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OM protein - protein search, using sw model 



Run on: January 22, 2 004, 16:31:15 ; Search time 70.555 Seconds 

(without alignments) 
2650.131 Million cell updates/sec 

Title: US-09-830-972-29 
Perfect score: 5923 

Sequence: 1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKI QAKI PGLKRKAE 1178 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 1107863 seqs, 158726573 residues 

Total number of hits satisfying chosen parameters: 1107863 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : A_Geneseq_19 Jun03 : * 

1 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA198 0 . DAT : * 

2 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1981 . DAT: * 

3 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1982 .DAT:* 

4 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1983 .DAT:* 

5 : / SIDS1/ gcgda t a / genes eq/ genes eqp - embl /AA1 984. DAT : * 

6 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1985 . DAT: * 

7 : / S I DS 1 /gcgda t a /genes eq/ genes eqp - embl /AA1 98 6. DAT : * 

8 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1987 . DAT: * 

9 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1988 . DAT: * 
10 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1989 .DAT: * 
11 : / SIDSl/ gcgda t a /genes eq/ genes eqp- embl /AA1990 . DAT : * 
12 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA19 91 .DAT: * 
13 : / SIDSl/gcgdata/geneseq/geneseqp-embl/AA1992 .DAT: * 
14 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA19 93 .DAT: * 
15 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1994 .DAT: * 
16 : / SIDSl/gcgdata/geneseq/geneseqp-embl/AA1995 . DAT: * 
17 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1996 .DAT: * 
18 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1997 .DAT: * 
19 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1998 .DAT: * 
20 : /SIDSl/ gcgda ta/ genes eq/ genes eqp-embl/AA1999 . DAT : * 
21 : / SIDSl/gcgdata/geneseq/geneseqp-embl/AA2000 . DAT: * 
22 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA2001 .DAT: * 
23 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA2002 .DAT: * 
24 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA2003 .DAT: * 

Pred. No. is the number of results predicted by chance to have a 

score greater than or equal to the score of the result being printed, 



and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 




AAY71311 




ID 


AAY71311 standard; Protein; 1178 AA. 


XX 






AC 


AAY71311; 




XX 






DT 


02-NOV-2000 (first entry) 


XX 






DE 


Human neurite growth inhibitor Nogo. 


XX 






KW 


Human; neurite growth inhibitor; Nogo; neural cell; myelin; CNS; 


KW 


central nervous 


system; neoplastic disease; antiproliferative; glioma; 


KW 


antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 


KW 


degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 


KW 


hyperprol iterative disorder; benign dysprol iterative disorder; diagnos 


KW 


psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 


KW 


structural plasticity; screening. 


XX 






OS 


Homo sapiens. 




XX 






FH 


Key 


Location/Qualif iers 


FT 


Misc-dif f erence 


187 


FT 




/label= Unknown 


FT 


Misc-dif f erence 


188 


FT 




/label= Unknown 


FT 


Misc-diff erence 


189 


FT 




/ label = Unknown 


FT 


Misc-dif f erence 


190 


FT 




/label= Unknown 


FT 


Misc-dif f erence 


221 


FT 




/label= Unknown 


FT 


Misc-dif f erence 


328 


FT 




/label= Unknown 


FT 


Misc-dif f erence 


477 


FT 




/labels Unknown 


FT 


Region 


994. .1174 


FT 




/note= "Region specifically described in claim 16" 


FT 


Region 


977 . . 1012 


FT 




/note- "Region specifically described in claim 16" 


FT 


Region 


1079. .1114 


FT 




/note= "Region specifically described in claim 16" 


XX 






PN 


WO200031235-A2. 




XX 






PD 


02-JUN-2000. 




XX 






PF 


05-NOV-1999; 99WO-US26160 . 


XX 






PR 


06-NOV-1998; 98US-0107446 . 


XX 






PA 


(SCHW/) SCHWAB M 


E. 


PA 


(CHEN/) CHEN M S 




XX 






PI 


Schwab ME, Chen 


MS; 


XX 






DR 


WPI; 2000-400052/34. 


XX 







PT Nogo proteins and nucleic acids useful for treating neoplastic 

PT disorders of the central nervous system and inducing regeneration of 

PT neurons - 

XX 

PS Claim 11; Fig 13; 122pp; English. 
XX 

CC The present sequence is a human Nogo protein which is a 

CC potent neural cell growth inhibitor and is free of all central nervous 

CC system (CNS) myelin material with which it is natively associated. The 

CC human Nogo sequence was derived by aligning human expressed sequence tags 

CC (ESTs) e.g. AA158636, AA333267, AA081783, AA167765, AA322918, AA092565, 

CC AA081525 and AA081840 with the rat Nogo sequence. 

CC Nogo proteins and fragments displaying neurite growth inhibitory 

CC activity are used in the treatment of neoplastic disease of the CNS 

CC e.g. glioma, glioblastoma, medulloblastoma, craniopharyngioma, ependyoma, 

CC pinealoma, haemangioblastoma , acoustic neuroma, oligodendroglioma, 

CC menagioma, neuroblastoma or retinoblastoma and degenerative nerve 

CC diseases e.g. Alzheimer's and Parkinson's diseases. Therapeutics which 

CC promote Nogo activity can be used to treat or prevent hyperprol if erative 

CC or benign dysprol if erative disorders e.g. psoriasis and tissue 

CC hypertrophy. Ribozymes or ant i sense Nogo nucleic acids can be used to 

CC inhibit production of Nogo protein to induce regeneration of neurons or 

CC to promote structural plasticity of the CNS in disorders where neurite 

CC growth, regeneration or maintenance are deficient or desired. 

CC The animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which 

CC can treat or prevent disorders or diseases of the CNS. 

CC Note: SEQ ID numbers 35-42 are referred in claim 32 and SEQ ID NO: 29 

CC in disclosure of the specification. However the specification does not 

CC include sequences for these SEQ ID numbers. 

XX 

SQ Sequence 1178 AA; 



Query Match 99.3%; Score 5882; DB 21; Length 1178; 

Best Local Similarity 99.7%; Pred. No. 1.8e-297; 

Matches 1175; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 
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960 


Db 


901 


IIMMIIIIMIIMMIIIIIIIIIIMIIIMIIMIIIIIIIIIIMIIIIMIII 
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Db 
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■ - ■ ^ 
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Qy 
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Db 
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Qy 
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ALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFSVPVI 
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Db 


1081 


IIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIMIMIIIIIIMIMIIIIIIIIIM 

ALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALI SLFSVPVI 


1140 



Qy 



1141 YERHQAQIDHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1178 



IIIIIIIIIIIMIIIillllllllllllllllllMI 

Db 1141 YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 



RESULT 2 
AAU04591 

ID AAU04591 standard; Protein; 1192 AA. 
XX 

AC AAU04591; 
XX 

DT 26-SEP-2001 (first entry) 
XX 

DE Human Nogo protein. 
XX 

KW Human; Nogo receptor; axonal growth; immunogen; antibody; nogo protein ; 

KW cranial trauma; cerebral trauma; spinal cord injury; stroke; 

KW demyelinating disease; multiple sclerosis; monophasis demyelination; 

KW encephalomyelitis; multifocal leukoencephalopathy ; panencephalitis; 

KW Marchiafava-Bignami disease; pontine myelinolysis ; adreno leukodystrophy; 

KW Pelizaeus-Merzbacher disease; Spongy degeneration; Alexander's disease; 

KW Canavan's disease; metachromatic leukodystrophy; viral infection; 

KW Krabbe's disease. 

XX 

OS Homo sapiens. 



XX 

FH Key Location/Qualifiers 

FT Domain 1054.. 1119 

FT /labels Lumenal_extracellular_domain 

FT /note= "This sequence is specifically claimed" 

FT Peptide 1055.. 1094 ^ 

FT /label= Pepl 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1064.. 1088 

FT /label= Pep2 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1074 . . 1098 

FT /label= Pep3 

FT /note- "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1084 . . 1108 

FT /label- Pep4 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically clY 18550X- 

FT Peptide 1095.. 1119 

FT /label = Pep5 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

XX 



PN WO200151520-A2 . 
XX 

PD 19-JUL-2001. 
XX 

PF 12-JAN-2001; 2001WO-US01041 . 
XX 

PR 12-JAN-2000; 2000US-0175707 . 



PR 26-MAY-2000; 2000US-0207366 . 

PR 29-SEP-2000; 2000US-0236378 . 
XX 

PA (UYYA ) UNIV YALE . 
XX 

PI Strittmatter SM; 
XX 

DR WPI; 2001-442138/47. 

DR N-PSDB; AAS09453 . 
XX 

PT Novel Nogo receptor protein useful for identifying modulator of Nogo 

PT protein or Nogo receptor protein, which is useful for treating central 

PT nervous system disorders 
XX 

PS Example 1; Page 101-104; 109pp; English. 
XX 

CC The sequence is the human Nogo protein, a 250kDa myelin-associated axon 

CC growth inhibitor. The invention relates to the use of the nogo receptor, 

CC nogo protein, their nucleic acids, vectors expressing them and antibodies 

CC against them, to isolate agents which block nogo receptor mediated axonal 

CC growth. The agent is useful for treating a central nervous system 

CC disorder which is a result of cranial or cerebral uma, spinal cord 

CC injury, stroke or a demyelinat ing disease selected from multiple 

CC sclerosis, monophasis demyel ination, encephalomyelitis, multifocal 

CC leukoencephalopathy, panencephalitis, Marchiaf ava-Bignami disease, 

CC pontine myelinolysis , adrenoleukodys trophy, Pelizaeus-Merzbacher disease, 

CC Spongy degeneration, Alexander's disease, Canavan's disease, 

CC metachromatic leukodystrophy, viral infection and Krabbe's disease. 

XX 

SQ Sequence 1192 AA; 



Query Match 98.2%; Score 5815; DB 22; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 5.6e-294; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 3; 
Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

1 i > i i i i i l.m i i i i n i i i i i i i i i 

Db 1 MEDLDQSPM¥B£BDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLE A 60 

QY 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

I Ml MIIIIMIIIIIIIIIIIMII IIIIIM Mill MUM IMIIII IMII II 

Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 
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SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 


180 


Db 
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u i ; mi i i h.u i i i i n i i h i i i u i 

SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 
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SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 


226 


Db 
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Ml: h = IIIIIIIIIIIMIIIIIIIIMIMII Mill 

SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 
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LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 
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> M 1 1 1,1 1 1 1 1 1 ! MM 1 1 M M 1 Mil 1 
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IIIIIIMIIIMIIIIIMIIIIIIIIIIIIIIMIIIM I I .1 I I M 



Db 



3 01 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 
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1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 


406 
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1 1 f 1 M 1 1 1 1 1 ! 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 | | | | | | | [ | | | | | | | | | | | [ I 1 1 1 1 1 1 1 1 1 1 1 1 

SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 


420 


Qy 


407 


DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 


466 


Db 


421 


M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 

DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 


480 


Qy 


467 


PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 
HI | | | | | | | | | || | | | | | | | | | | | | | | || || | | | | | | || | HI | | || || | || | || | 
PLLGD PTS ENKTDE KKI EEKKAQ I VTEKNTSTKTSN PFLVAAQDS ETD YVTTDNLTKVTE 


526 


Db 


481 


540 


Qy 


527 


EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 
1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 


586 


Db 


541 


1 1 1 1 1 M 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 M 1 1 1 1 1 1 1 1 | 1 | | | | j | 

EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 


600 


Qy 


587 


EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 


646 


Db 


601 


1 1 1 1 1 l ll ll ll ll ll l l ll ll ll l ll | M M ) M II 1 1 II II II II II 1 M II II 1 II I 

EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 


660 


Qy 


647 


EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 


705 


Db 


661 


720 


Qy 


706 


MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II 1 1 1 1 1 1 1 1 


765 


Db 


721 


1 < 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 I 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I 1 1 1 1 1 1 1 1 1 1 1 

MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFESMI 


780 


Qy 


766 


EYENKESDSA5E0EGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTA 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 




Db 


781 


1 1 1 1 i ■ ■ ■ ■ i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i | | | | | i | | | | | | | | | | | | | 

EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAW 


840 


Qy 


826 


SNDDLFI SKEAQI RETETFSDSSPI EI I DEFPTLI SSKTDSFSKLAREYTDLEVSHKSEI 
1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 


885 


Db 


841 


i i < i i i i i i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | I | M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 

SNDDLFI SKEAQI RETETFSDSSPI EI I DEFPTLI SSKTDSFSKLAREYTDLEVSHKSEI 


900 


Qy 


886 


ANAPDGAGSLPCTELPHDLSLKNI QPKVEEKI SFSDDFSKNGSATSKVLLLPPDVSALGH 


945 


Db 


901 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 

ANAPDGAGSLPCTELPHDLSLKN I QPKVEEKI SFSDDFSKNGSATS KVLLLPPDVSALA- 


959 


Qy 


946 


TQAEI ESI VKPKVLEKEAEKKLPSDTEKEDRSPSAI FSADLGKTSWDLLYWRDI KKTGV 
II 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II • 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


1005 


Db 


960 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I I I I I I | | | | - I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TQAEIESI VKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 


1019 


Qy 


1006 


VFGASLFLLLSLTVFSIVSVTAYIALALLSVT SFRI YKGVI QAI QKSDEGHPFRAYLES 

llllltlllllllllllllliiiiiiiiiiiiiiiiiiiiiiiiiiiiitiiiiiiiiii 


1065 


Db 


1020 


1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 II II 1 II 1 I I I I | | | | | | | | | | | || | | | | | | | || | | | | | | | | | 
VFGASLFLLLSLTVFS I VSVTAYIALALLSVTI SFRI YKGVI QAI QKSDEGHPFRAYLES 


1079 


Qy 


1066 


EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1125 


Db 


1080 


- I.N 1 i U 1 II 1 U 1 1 M 1 1 1 I Ml | 

EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1139 


Qy 


1126 


LLI LALI SLFS VPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 




Db 


1140 


MIIMIIMMIIIIIIIIIIIIIMIIMMIMIIIIIIIIIIIIIIIM 

LL I LAL I SLFS VPVI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 





RESULT 3 
ABP68600 

ID ABP68600 standard; Protein; 1192 AA . 
XX 

AC ABP68600; 
XX 

DT 14-JAN-2003 (first entry) 
XX 

DE Human pancreatic cancer expressed protein SEQ ID NO 71. 
XX 

KW Human; pancreas; cancer; gene therapy; vaccine; immuno stimulant ; 

KW cytostatic; tumour. 

XX 

OS Homo sapiens . 
XX 

PN WO200260317-A2 . 
XX 

PD 08-AUG-2002. 
XX 

PF 30-JAN-2002; 2 002WO-US0278 1 . 
XX 

PR 30-JAN-2001; 2 0 0 1US-2653 05P . 

PR 31-JAN-2001; 2001US-265682P . 

PR 09-FEB-2001; 2001US-267568P . 

PR 21-MAR-2001; 2 001US-278651P . 

PR 28-APR-2001; 2001US-287112P . 

PR 16-MAY-2001; 2 001US-2 9 163 IP . 

PR 12-JUL-2001; 2001US-305484P. 

PR 20-AUG-2001; 2001US-313999P . 

PR 27-NOV-2001; 2 001US-333 62 6P . 
XX 

PA (CORI-) CORIXA CORP. 
XX 

PI Benson DR, Kalos MD, Lodes MJ # Persing DH, Hepler WT, Jiang Y; 
XX 

DR WPI; 2002-627435/67. 

DR N-PSDB; ABV94680. 
XX 

PT New isolated polynucleotide and pancreatic tumor polypeptides, useful 

PT for diagnosing, preventing and/or treating cancer, particularly 

PT pancreatic cancer 
XX 

PS Claim 2; SEQ ID NO 71; 300pp + Sequence Listing; English. 
XX 

CC The invention relates to an isolated polynucleotide (I) comprising: (a) 

CC any of a group of over 4000 nucleotide sequences (ABV94 62 8 -ABV9914 5) ; 

CC (b) complements of (a) ; (c) sequences consisting of at least 20 

CC contiguous residues of (a); (d) sequences that hybridize to (a), under 

CC moderately stringent conditions; (e) sequences having at least 75% or 9 

CC identity to (a); or (f) degenerate variants of (a). Polypeptides 

CC (ABP68596-ABP68637) encoded by (I) and oligonucleotide can be used to 

CC detect cancer in a patient and compositions comprising polypeptides, 

CC polynucleotides, antibodies, fusion proteins, T cell populations and 

CC antigen presenting cells expressing the polypeptide are useful in 

CC treating pancreatic cancer and stimulating an immune response. The 



CC polynucleotides can be used as probes or primers for nucleic acid 

CC hybridisation, in the design and preparation of ribozyme molecules for 

CC inhibiting expression of the tumour polypeptides and proteins in the 

CC tumour cells, in vaccines and for gene therapy. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format directly from WIPO 

CC at ftp.wipo.int/pub/published_pct_sequences. 
XX 

SQ Sequence 1192 AA; 

Query Match 98.2%; Score 5815; DB 23; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 5.6e-294; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 3; 
Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

1 1 1 1 1 M 1 1 1 ' M M I : II IM 1 1 I II I II MM I I II III Ml 1 1 1 II 1 1 1 1 1 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

u i i : i I u I i i i i I i i i in u I i. 

Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

IIIIIMIIIIIIIMIIMIIIMIIIIIIIIIIIIIIMMIIIMIIIMMIMM 

Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

MM |: : I II I I I IMM I I I I I I I I I II HIM 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 3 00 

Qy 287 GSSFS VS PKAESAVI VANPREE 1 1 VKNKDEEEKLVSNN I LHXQQELPTALTKLVKEDE W 346 

IIMIIIIIIIMIIIIMIIIIIIIIMIIIIMIIIIM 

Db 301 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 4 06 

MMMMIMM MUM Mill Ml Mill I Mill IIMII IMIMM II Mill 

Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 420 

QY 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPPNPAATESIATNIF 466 

MMMMMIII IMIM M IIMII Mill I Mill IMMIMIMIM II Mill 

Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 480 

Qy 467 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 

Ml MUM MMMMMIMMMMMMMM MMMMMMMMMMI 

Db 481 PLLGDPTSENKTDEKKI EEKKAQI V^EKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 54 0 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

IIIIIMMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMMIIIIIIIIIMIII 

Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

Ml M 1 1 1 II 1 1 MM II M 1 1 . M IM II I M II 1 1 1 Ml I II 1 1 Ml M 1 1 i I 

Db 601 EESEATPSPVXPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 



Qy 


647 


EAMSVSL - KVSG I KEE I KE PEN I NAALQETEA P Y I S IACDLI KETKLSAEPAPDFSDYSE 


705 


Db 


661 


1 MlllllllllllllilllllllllllllMIIIIIIIIIIIIIIIIIIIII 

EAMSVSLKKVSGI KEE I KEPENINAALQETEAPYIS IACDLI KETKLSAEPAPDFSDYSE 


720 


Qy 


706 


MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDS I PDVPQKQDETVMLVKESLTETSFESMI 


765 


Db 


721 


llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDS I PDVPQKQDETVMLVKESLTETSFESMI 


780 


Qy 


766 


EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 


825 


Db 


781 


IIMIIMIMIIIMIMIIIIIIIIIIMIIIIMIMIIIIMIMIIMIIIIIII 

EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 


840 


Qy 


826 


SNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


885 


Db 


841 


IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIMIIMIIIIII 

SNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


900 


Qy 


886 


ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 


945 


Db 


901 


MMMMMMMMMMMMMMMMMMIMMMMMMMMMIM 

ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 


959 


Qy 


946 


TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 


1005 


Db 


960 


Illlllllllllll IIIIIIIIIIIIMMIIIIIIIhl 1 1 ! 1 1 , 1 1 1 1 1 1 1 1 1 

TQAE I ESI VKPKVLVKEAEKKLPSDTEKEDRSPSAI FSAELSKTSWDLLYWRDI KKTGV 


1019 


Qy 


1006 


VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 


1065 


UJJ 


i n o n 
lUzU 


M 1 1 II II 1 M 1 1 1 1 1 II II II M II 1 M 1 M M 1 M II II II II II II II II 1 II 1 M 1 

VFGASLFLLLSLTVFS I VSVTAYIALALLSVTISFRI YKGVI QAIQKSDEGHPFRAYLES 


1079 


Qy 


1066 


EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1125 


Db 


1080 


MM III Mill MIMI II Mill Mill II Mill II Ml Mill II MM II Mill 

EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1139 


Qy 


1126 


LLI LALI SLFS VPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 




Db 


1140 


1 1 1 1 M II II II II II II M II II 1 1 1 1 II 1 1 II M 1 II II II M II II II II 

LLI LALI SLFS VPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 





RESULT 4 
AAY56967 

ID AAY56967 standard; Protein; 1192 AA. 
XX 

AC AAY56967; 
XX 

DT 25-APR-2000 (first entry) 
XX 

DE Human MAGI polypeptide. 
XX 

KW MAGI protein; neuroendocrine-specif ic protein; neuropathy; human; 

KW spinal injury; neuronal degeneration; neuromuscular disorder; cancer; 

KW psychiatric disorder; developmental disorder; inflammatory disorder; 

KW stroke; cytostatic; cerebroprotect ive ; neuroprotective. 

XX 

OS Homo sapiens. 
XX 

PN WO200005364-A1 . 



PD 03-FEB-2000. 
XX 

PF 21-JUL-1999; 9 9WO-GB023 60 . 
XX 

PR 22-JUL-1998; 98GB-0016024 . 

PR 19-JUL-1999; 99GB-0016898 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC . 
XX 

PI Michalovich D, Prinjha RK; 
XX 

DR WPI; 2000-182693/16. 

DR N-PSDB; AAZ56886. 
XX 

PT Novel polypeptides related to neuroendocrine-specif ic proteins and 

PT polynucleotides useful for diagnosis of various diseases and for 

PT treatment of cancer and neurological disorders - 
XX 

PS Claim 2; Page 2 0-21; 35pp; English. 
XX 

CC The invention relates to human MAGI protein, which is similar to 

CC neuroendocrine-specif ic protein. The MAGI protein can be expressed by 

CC standard recombinant methodology. The MAGI polypeptides, polynucleotides 

CC and antibodies are useful for treating diseases, including neuropathies, 

CC spinal injury, neuronal degeneration, neuromuscular disorders, 

CC psychiatric disorders and developmental disorders, cancer, stroke and 

CC inflammatory disorders. The polynucleoitde is also useful for chromosome 

CC localization and for tissue expression studies. The present sequence 

CC represents the human MAGI protein. 

XX 

SQ Sequence 1192 AA; 



Query Match 98.1%; Score 5810; DB 21; Length 1192; 

Best Local Similarity 97.2%; Pred. No. le-293; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3; 



Qy 


l 


MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 


60 


Db 


l 


MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 


60 


Qy 


61 


AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 


120 


Db 


61 


i u , i i i u 1 i : i m i i ri m i ri u 

AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 


120 


Qy 


121 


SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 


180 


Db 


121 


IMIMIMII IIIIIMIIIMIMIIIIIIIIIIII 1 llllllilllll Ml MM! 

SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 


180 


Qy 


181 


SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 


226 


Db 


181 


IN: h : ,IM 1 1 III 1 1 1 1 . Ml 1 1 1 M II 1 1 Mill 

SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 


240 


Qy 


227 


LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 


286 


Db 


241 


MIMIMIIIIIIIIIIIIIIIIIMIIMMIIIIIIMIIIIIIIIIIIIIIIMII 

LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 


300 


Qy 


287 


GSS FS VS PKAESAVT VANPREE 1 1 VKNKDEEEKLVSNN I LHXQQELPTALTKLVKEDE W 


346 



IIMMIMMMMIMIIIIIIIIMIIIIMIIIIMI IMIIIIMIIIIIIIII 

Db 301 GSSFSVSPKAESAVIVANPREEI I VKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 

QY 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESKV 406 

IMIIIIIIIIIIIIMIIIIIIMIIIIIIIIIMIIIIIIIIIIMMIIIIIIIIII 

Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 42 0 

QY 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

Miiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 r 1 1 i 1 1 

Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 480 

QY 467 PLLEDPTSENXTDEKKI EEKKAQI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 

Ml MMM IIIIIIIMIIMIIIIIIIMIIIM IIMMIIII! Illlll 

Db 481 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 54 0 

Qy 52 7 EWANMP TPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

MIMIIIMMIIIIIIIIIIIMIIMIIIMIIMIIIIIMIIIIIIIIIIMIII 

Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

: - ^ ' - . I 

Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

M IN IIIM'II MINI. Illlll Illlll MINI IIMMhlllli 

Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I i M ,! i II I I I I I I M || | i |'| I M 

Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 825 

MIMMIIIIIIIIIMIIIIIIIIIIIIIIIIMIMIIIIMIIIIIIIIIIIMII 

Db 781 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 84 0 

QY 826 SNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

IIIIMIIIIIIMMMIIIIMMIIIIIIIIIMMIMIIIIIMIIIIIMMII 

Db 841 SNDDLFI SKEAQI RETETFSDSSPI EI IDEFPTLI SSKTDSFSKLAREYTDLEVSHKSE I 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

MMMIIMIMIIIIIIMIIIIIMMIIIIIIIIIIIIIIIIMIIIIMMII 

Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy 946 TQAE I ES I VKPKVLEKEAEKKLPSDTEKEDRS PSA I FSADLGKTS WDLLYWRD I KKTGV 1005 

MMiiiiiiiiii m 1 1 1 f 1 1 1 1 1 1 f i [ r i r 1 1 1 1 r i = i iiiiiiiiiiiimiiii 

Db 960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAE SWDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

f r 1 1 r 1 1 1 j j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ( r i r 1 1 1 1 f 1 1 j 1 1 j 1 1 1 1 1 1 f 1 1 r 

Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLES 1079 

Qy 1066 EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1125 

IMIIIMIIIIIIIIIIIIMIIIIIIMMIIIIIMIIIMIIIIIIIIIMIIIII 

Db 108 0 EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 113 9 

Qy 1126 LLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

IIMIMIIIIIIMIIIIIIIIIIMMIIIIIIIIIIMIIIMIMIIM 



Db 



1140 LLI LALI SLFSVPVT YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 



RESULT 5 
AAB82349 

ID AAB82349 standard; Protein; 1192 AA. 
XX 

AC AAB82349; 
XX 

DT 23-JUL-2001 (first entry) 
XX 

DE Human NOGO-A protein. 
XX 

KW NOGO-A; human; chromosome 2p21; neuropathy; spinal injury; 

KW brain injury; stroke; neuronal degeneration; Alzheimer's disease; 

KW Parkinson's disease; neuromuscular disorder; psychiatric disorder; 

KW developmental disorder; neuroprotective; nootropic; neuroleptic; 

KW antiparkinsonian; cerebroprotective; neuroleptic; diagnosis; 

KW therapy. 

XX 

OS Homo sapiens. 
XX 

PN WO200136631-A1. 
XX 

PD 25-MAY-2001. 
XX 

PF 14-NOV-2000; 2 000WO-GB04345 . 
XX 

PR 15-NOV-1999; 99GB-0026995 . 

PR 24-JAN-2000; 2000GB-0001550 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Michalovich D, Prinjha R; 
XX 

DR WPI; 2001-343822/36. 

DR N-PSDB; AAF90324 . 
XX 

PT New polypeptide designated NOGO-C is a splice variant of the human NOGO 

PT gene and may be useful in the treatment of neural disorders including 

PT Alzheimer's and Parkinson's diseases 
XX 

PS Disclosure; Page 26-27; 25pp ; English. 
XX 

CC The present sequence is that of human NOGO-A. NOGO-A is a 

CC previously known splice variant of the human NOGO gene on chromosome 

CC 2p21. The invention relates to a novel splice variant, NOGO-C (see 

CC TSfiB82Ba8Al2hetmpr6HideseH©©Oafiedp©iy|iffipBcbries drndepefcynneiheotamdeaiaaxid 

CC methods for producing such polypeptides by recombinant techniques. 

CC Also disclosed are methods for utilising NOGO-C polypeptides and 

CC polynucleotides in the treatment of diseases including neuropathies, 

CC spinal injury, brain injury, stroke, neuronal degeneration, for 

CC disorders, psychiatric disorders and developmental disorders. Also 

CC provided are methods for identifying agonists and agonists for 

CC use in treating conditions associated with NOGO-C imbalance, and 

CC diagnostic assays for detecting diseases associated with 



CC inappropriate NOGO-C activity or levels. 
XX 

SQ Sequence 1192 AA; 

Query Match 98.1%; Score 5810; DB 22; Length 1192; 

Best Local Similarity 97.2%; Pred. No. le-293; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3; 
QY 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

MIIIIIIIMIMIIIIIIIIIIIIIIIMIIMMIMIIIIIIMIIIIMIIIIII 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

MIIIIIIIMIIIIIIII MINIM III Mill II IIIIIMIIWIillt 

Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGA WXXXXKI MDLKEQPGNT I SAGQEDF PS VLLETAASXPSLS P 226 

MM h : I I I I I I I I I I I I I I I I I I I | 1 | | | | | | | MIM 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

Qy 227 LSAAS FKEHE YLGNLS T VL PTEGTLQENVS EAS KE VS E KAKTLL I DRDLTEFS ELE YS EM 286 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 

Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 3 00 

Qy 287 GSSFS VS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEW 346 

I ( I I I I ! ! I I I I I I I I ! IIIIIIIIIIIIMMII 

Db 301 GSSFS VS PKAESAVI VANPREE 1 1 VKNKDEEEKLVSNN I LHNQQELPTALTKLVKEDE W 360 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 406 

| n i i i i i i , i i ii .| m i ■ i i i i 

Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKI ESNLESKV 42 0 

QY 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M MIMIMMIIMMIIMM 

Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 480 

QY 467 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 

Ml I M 1 1 1 IIIMI! I I , I : | I h! i 

Db 481 PLLGDPTSENKTDEKKIEEKKAQI VTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 54 0 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 58 6 

M 1 1 1 1 1 1 1 ] 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1! M 1 1 1 M M I ! I If II I M 1 1 r I M M 

Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

MIMIMIIMIIMIIMIIIIIIIMIMIIMIIIIIIIIMIIIIIIMMIIII 

Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL -KVSG I KEE I KEPENI NAALQETEAP YI S I ACDLI KETKLSAEPAPDFSDYSE 705 

cc I M 1 1 1 1 1 1 II 1 1 1 1 II I M 1 1 1 1 1 1 1 II M M 1 1 1 1 1 1 II I M 1 1 1 II 1 1 II 1 1 1 1 M 

Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLI KETKLSAEPAPDFSDYSE 720 



Qy 



706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 



/ 



IMMIMII IIIIMMIIIIII MINI lilllllillMIIIIIIIIIMMIIM 

MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 



Db 


721 


MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 


780 


Qy 


766 


EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 


825 


Db 


781 


MMMMMMMIMM IIIIIIIIMIIMIII Illllllll MIMMM 

EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 


840 


Qy 


826 


SNDDLFI SKEAQI RETETFSDSS P I EI IDEFPTLI SSKTDSFSKLAREYTDLEVSHKSEI 


885 


Db 


841 


M 1 1 1 1 1 1 MM 1 1 1 III Ml II 1 II 1 Ml 1 II 1 II , M II 1 Ml 1 II 1 II III. 

SNDDLFI SKEAQI RETETFSDSSPI EI IDEFPTLI SSKTDSFSKLAREYTDLEVSHKSEI 


900 


Qy 


886 


ANAPDGAGSLPCTELPHDLSLKNI QPKVEEKI SFSDDFSKNGSATSKVLLLPPDVSALGH 


945 


Db 


901 


MMMMMIMMMMM MMIIMI MMMMMMIMM MUM 

ANAPDGAGSLPCTELPHDLSLKNI QPKVEEKI SFSDDFSKNGSATSKVLLLPPDVSALA- 


959 


Qy 


946 


TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDI KKTGV 


1005 


Db 


960 


IIIIMMIIIIII IIIIIMMMIIIIIIIIMIIhl 1 M M 1 II 1 1 1 1 1 1 M 1 1 

TQAEI ES I VKPKVLVKEAEKKLPSDTEKEDRSPSAI FSAELSKTSWDLLYWRDI KKTGV 


1019 


Qy 


1006 


VFGASLFLLLSLTVFSI VSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLES 


1065 




102 0 


Ml i 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

VFGASLFLLLSLTVFSI VSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLES 


1079 


Qy 


1066 


EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1125 


Db 


1080 


M M MM 1 1 1 M 1 1 1 M MM M || I 1 1 1 1| 1 1 1 1 1 1 1 1 MUM II 1 IMIIII 1 

EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1139 


Qy 


1126 


LL I LALI SLFS VPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 




Db 


1140 


1 1 1 1 1 1 1 1 If 1 1 1 1 i 1 1! 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

LL I LALI SLFS VP VI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 





RESULT 6 
ABB81078 

ID ABB81078 standard; Protein; 1192 AA . 
XX 

AC ABB81078; 
XX 

DT 05-NOV-2002 (first entry) 
XX 

DE Human neurotransmitter receptor protein Nogo-A. 
XX 

KW Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 
KW central nervous system; peripheral nervous system; tranquillizer; Nogo; 
KW vulnerary; cerebroprotective; anti-tumour; antidiabetic; anticonvulsant; 
KW nootropic; antiparkinsonian; ophthalmological ; analgesic; hepatotropic; 
KW osteopathic; vasotropic; nephrotropic ; cytostatic; antigen; gene therapy; 
KW neurotransmitter receptor; human; receptor. 
XX 

OS Homo sapiens. 
XX 

PN US2002072493-A1 . 
XX 

PD 13-JUN-2002. 
XX 

PF 28-JUN-2001; 2001US-0893348 . 
XX 



PR 19-MAY-1998; 98 IL- 0124500 . 

PR 21-JUL-1998; 98WO-US14715 . 

PR 22-DEC-1998; 98US- 0218277 . 

PR 19-MAY-1999; 99US-0314161 . 
XX 

PA { YEDA ) YEDA RES & DEV CO LTD. 
XX 

PI Eisenbach- Schwartz M, Hauben E # Cohen IR, Beserman P, Mosonego A; 

PI Moalem G; 

XX 

DR WPI; 2002-607255/65. 

DR N-PSDB; ABN86601. 
XX 

PT Promoting nerve regeneration and preventing neuronal degeneration in 

PT the central/peripheral nervous system from injury/disease, comprises 

PT administering nervous system-specific activated T cells/antigen, or 

PT analogs/peptides 
XX 

PS Examples; Page 53-56; 93pp; English. 
XX 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central /peripheral nervous system (NS) . The method involves administering 

CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue or 

CC combinations. The method is useful for promoting nerve regeneration and 

CC preventing neuronal degeneration in central /peripheral nervous system 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery such as tumour excision. The disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease 

CC is diabetic neuropathy, senile dementia, Alzheimer's disease, Parkinson's 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington's chorea, 

CC amyotrophic lateral sclerosis, non-arterit ic optic neuropathy, and 

CC vitamin deficiency, intervertebral disc herniation, prion diseases such 

CC as Creutzfeldt -Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 

CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia vera, immunoglobulin (Ig)A- and IgG gamma- 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e.g., alcohol or organophosphates) , Charcot -Marie-Tooth disease, ataxia 

CC telangectasia, Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy, Giant axonal neuropathy, Refsum's disease, Fabry's 

CC disease, or 1 ipoproteinemia . The present sequence represents the human 

CC neurotransmitter receptor protein Nogo-A, an example of NS-specific 

CC antigen. 

XX 

SQ Sequence 1192 AA; 

Query Match 98.1%; Score 5810; DB 23; Length 1192; 

Best Local Similarity 97.2%; Pred. No. le-293; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3; 



Qy 



1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 



i:i i i i:. i i i i ;,i i i i i i m mm I I 

EDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 



Db 


1 


MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 


60 


Qy 


61 


AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 


120 


Db 


61 


M Mil 1 1 1 1 Ml Ml 1 1 1 : 1 1 1 1 MIM 1 

AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 


120 


Qy 


121 


SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 


18 0 


Db 


121 


1 1 Ml 1 M U MM 1 i 1 1 1 ! 1 1 1 MIMM 1 1 

SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 


18 0 


Qy 


181 


SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 


226 


Db 


181 


Mh h : IIIIIIIIIIIIIIIIIIMMMIIII Mill 

SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 


240 


Qy 


227 


LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 


286 


Db 


241 


IMIMIIIIIIIIIIIIMIIIMIIMIIIIIMIIIIIIIIIMIIMIIIIIIIII 

LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 


300 


Qy 


287 


GSSFSVSPKAESAVI VANPREEI I VKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 
1 1 1 1 I 1 1 1 1 I 1 i 1 1 1 1 1 1 1 II 1 1 1 I II 1 1 1 1 I i i i i 


346 


Db 


301 


1 > 1 1 M 1 1 1 1 i i (I l l l 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 II II M M 

GSSFSVSPKAESAVI VANPREEI I VKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 36 0 


Qy 


347 


SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 
1 1 1 1 II 1 1 1 II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l l l I i i i i i i i i 


406 


Db 


361 


I I l 1 1 1 1 1 M 1 1 1 I 1 1 1 1 M f 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 42 0 


Qy 


407 


DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGI KDRSGAYI TCAPFNPAATES IATNI F 


466 


Db 


421 


IIIIIIIIIMMIIMIMIIIIIIMIMIIIMI IMIIIIIIIIIIIMIIIIII 

DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGI KDRPGAYI TCAPFNPAATES I ATNIF 


480 


Qy 


467 


PLLEDPTSENXTDEKKI EEKKAQI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 

Ml IMIII IIIIIIIIIIIIIIMIIIIIIIIMI 1 1 1 1 U Ml 


526 


Db 


481 


PLLGDPTSENKTDEKKI EEKKAQI VTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 


540 


Qy 


527 


EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 

Mllllllllll MINIM Ml IIMIIIMIMIIIMMIIIIIIIIIillMIIII 


586 


Db 


541 


EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 


600 


Qy 


587 


EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 


646 


Db 


601 


MIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIMMIIIIIIMI 

EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 


660 


Qy 


647 


EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 


705 


Db 


661 


M 1 1 1 1 1 IIIMIIIIIIIIIIIIIIIIIIIMIIIMIIIIMIIIIIIIMIIIIII 

EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 


720 


Qy 


706 


MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 


765 


Db 


721 


IIIMMIIIIIMIMIIMIIIIMIIIIIIMMIIIIMMIIIIIIIIIIIIIM 

MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 


780 


Qy 


766 


EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 
1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 I l l l l I M i t i i i i i i i i i i i i . i i i i 


825 


Db 


781 


1 1 M 1 1 1 1 1 1 1 | | | U | | | U m 1 1 1 1 1 1 1 1 1 1 1 1 | | | | 1 1 1 1 1 1 1 1 1 1 1 1 | | | | 

EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 84 0 


Qy 


826 


SNDDLFI SKEAQI RETETFSDSSPI EI I DEFPTLI SSKTDSFSKLAREYTDLEVSHKSEI 


885 



M Ml Mi I MIM MIM ' M MMMIM I M, MIM I 



Db 


841 


SNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


900 


Qy 


886 


ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 

1 1 1 1 1 1 II 1 1 1 1 1 1 1 I 1 1 1 1 1 1 I I I I I I I I I | | | | M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 
ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 


945 


Db 


901 


959 


Qy 


.946 


TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRS PSAI FSADLGKTSVVDLLYWRDI KKTGV 


1005 


Db 


960 


M 1 1 1 1 1 II 1 1 1 1 1 II II 1 1 1 | | | | | | | | | | | | | | | | | : | | | | M 1 1 1 1 1 1 1 1 1 1 1 1 1 
TQAE I ES I VKPKVLVKEAEKKLPSDTEKEDRS PSAI FSAELSKTS WDLLYWRDI KKTGV 


1019 


Qy 


1006 


VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 


1065 


Db 


1020 


1 1 1 1 1 i 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 

VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLES 


1079 


Qy 


1066 


EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1125 


Db 


1080 


IIIIIIIIIIIMIMIIIIIMIIIIIIIIIMIIIIIIIIIIMIMMIIIIIMM 

EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1139 


Qy 


1126 


LLI LAL I SLFS VPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 




Db 


1140 


MIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIMIIIIMIII 

LL I LAL I SLFS VPVI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 





RESULT 7 


ABG3 093 8 


ID 


ABG30938 standard; Protein; 1192 AA. 


XX 




AC 


ABG3 0 93 8; 


XX 




DT 


21-OCT-2002 (first entry) 


XX 




DE 


Human NogoA protein. 


XX 




KW 


Human; Nogo; BACE; acute neuronal injury; spinal injury; head injury; 


KW 


stroke; peripheral nerve damage; neoplastic disorder; glioblastoma; 


KW 


neuroblastoma; hyperprol if erative disorder; dysprol if erative disorder- 


KW 


cirrhosis; psoriasis; keloid formation; fibrocystic condition; cancer; 


KW 


tissue hypertrophy; central nervous system; axon regeneration; NogoA; 


KW 


Nogo-associated disease; metastasis. 


XX 




OS 


Homo sapiens. 


XX 




PN 


WO200257483-A2. 


XX 




PD 


25-JUL-2002 . 


XX 




PF 


18-JAN-2002; 2002WO-GB00228 . 


XX 




PR 


18-JAN-2001; 2001GB-0001312 . 


XX 




PA 


(GLAX ) GLAXO GROUP LTD. 


PA 


(SMIK ) SMITHKLINE BEECHAM PLC. 


XX 




PI 


Blackstock WP, Hale RS, Prinjha R, Rowley A; 


XX 


DR 


WPI; 2002-599722/64. 


DR 


N-PSDB; ABK90134. 



PT Identifying modulators of Nogo or BACE activity for treating acute 

PT neuronal injuries, neoplastic or dysprol iterative disorders, comprises 

PT providing and monitoring interaction between Nogo and BACE polypeptides 

PT 

XX 

PS Disclosure; Page 59-62; 68pp; English. 
XX 

CC The present invention relates to a new method of identifying modulators 

CC of Nogo function or BACE activity. The method involves providing Nogo and 

CC BACE polypeptides capable of binding with each other, monitoring the 

CC interaction between these polypeptides, and determining if the test agent 

CC is a modulator of Nogo or BACE activity. The method is useful in treating 

CC acute neuronal injuries, such as spinal or head injury, stroke, 

CC peripheral nerve damage, and in neoplastic (e.g. glioblastomas, 

CC neuroblastomas) , hyperprol iterative or dysprol iterative disorders (e.g. 

CC cirrhosis, psoriasis, keloid formation, fibrocystic conditions, tissue 

CC hypertrophy) of the central nervous system. The BACE polypeptide is 

CC useful in screening methods to identify agents that may act as modulators 

CC of BACE activity and in particular agents that may be useful in treating 

CC Nogo-associated diseases. The modulators of Nogo or BACE polypeptides, 

CC and the polynucleotide encoding the BACE polypeptide are useful in 

CC manufacturing a medicament for the treatment or prevention of disorders 

CC responsive to the modulation of Nogo activity, in alleviating the 

CC symptoms or improving the condition of a patient suffering from this 

CC disorder, in axon regeneration, or in preventing metastasis or spreading 

CC of a cancer. The polynucleotide may also be an essential component in 

CC assays, a probe, in recombinant protein synthesis, and in gene therapy 

CC techniques. The present amino acid sequence represents the human NogoA 

CC protein of the invention. 

XX 

SQ Sequence 1192 AA; 



Query Match 98.1%; Score 5810; DB 23; Length 1192; 

Best Local Similarity 97.2%; Pred. No. le-293; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3 



Qy 


1 


MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 


60 


Db 


1 


1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 r i r 1 1 1 1 r f 1 1 1 1 r ii f 1 1 1 1 f 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 


60 


Qy 


61 


AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 


120 


Db 


61 


MINI IMIIIIMIIIIIIMIIIIII MINI IMIIIMIIIIIIIIIIIMIIII 

AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 


120 


Qy 


121 


SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 


180 


Db 


121 


IIIMMIMMIIIMMIIIIIMIIIIMMIIIIIIIIIIIMIIIIIIMIMII 

SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 


180 


Qy 


181 


SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 


226 


Db 


181 


IN: h : IIIIIMIIIIMIIIIIIIIIIIIMI Mill 

SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 


240 


Qy 


227 


LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 


286 


Db 


241 


MMMMIIIIMIIIIIIMIIIIIIIIIIIMIIMIMIIIIIMMIMIIIIII 

LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 


300 



Qy 


287 


GSSFSVSPKAESAVI VANPREEI IVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEVV 346 

IMIIMIMIIIIIIIIIIIIIMMIMIIIIIMIIII II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GSSFSVSPKAESAVIVANPREEI I VKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 


Db 


301 


Qy 


347 


SSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESKV 


406 


Db 


361 


1 II IIIIIIIIIIIIIIIIMMIIIIIIIIIII MINIMI Mill II MINI MM 

SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 


420 


Qy 


407 


DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 


466 


Db 


421 


IIMIIIIIIIIIMIMIIIIIIIIIIIIIMIIII llllllllllllllllllllll 

DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 


480 


Qy 


467 


PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 

Ml MINI IIIIIIIIIIIIIIIIIMIMMIII MIIIMIIMIIIIIIIIII 


526 


Db 


481 


PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 


540 


Qy 


527 


EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 

IIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIMI 


586 


Db 


541 


EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 


600 


Qy 


587 


EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 


646 


Db 


601 


MIIIIMIIIIIIIIIIIMIIMMIIIIIIIIIIIIIIMIIIMIIIIIINIIM 

EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 


660 


Qy 


647 


EAMSVSL-KVSG I KEE I KEPENI NAALQETEAPYI SI ACDLI KETKLSAEPAPDFSDYSE 


705 


Db 


661 


MMMI IIIMIIMIIIIIIIIIIIIIIIIIMIMMIIIIIIIIIIIIIIIMII 

EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYI SI ACDLI KETKLSAEPAPDFSDYSE 


720 


Qy 


706 


MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 


765 


Db 


721 


IIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIMMIIIIIMIIIII 

MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 


780 


Qy 


766 


EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 


825 


Db. 


781 


IMIIIIIIIIMIIIIIIIMIIIMIIIlllllMIIIIIIIIIIIIMIIIIIIIM 

EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 


840 


Qy 


826 


SNDDLFI SKEAQI RETETFSDSSPI EI I DEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


885 


Db 


841 


IIIIIMMIIIIIIIIMIIIIMIIIMIIIMMMIMIIIIIIIIIIIIMIIII 

SNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


900 


Qy 


886 


ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 


945 


Db 


901 


■ 1 1 f 1 1 1 1 1 f 1 1 1 f 1 1 1 1 1 f i r r J 1 1 1 J i r j 1 1 1 1 f 1 1 r 1 1 1 1 1 1 j i f 1 1 1 1 1 1 1 1 1 1 

ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 


959 


Qy 


946 


TQAEI ESI VKPKVLEKEAEKKLPSDTEKEDRSPSAI FSADLGKTSWDLLYWRDI KKTGV 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 


1005 


Db 


960 


111,1111111111 1 1 1 1 1 1 1 1 1 I 1 I 1 1 II 1 1 1 1 1 1 1 1 - 1 1 1 1 1 1 1 1 1 1 ] 1 I I 1 1 1 | 1 

TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAI FSAELSKTSWDLLYWRDI KKTGV 


1019 


Qy 


1006 


VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLES 


1065 


Db 


1020 


IMMMIIIIIIMIMIIIIMIIMIIIMIIIIMIIIMIIIIIIIMIIIIMI 

VFGASLFLLLSLTVFS I VS VTAY I ALALLS VTI SFR I YKGVI QA I QKSDEGHPFRAYLES 


1079 


Qy 


1066 


EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1125 


Db 


1080 


IMIIMIIIIIIIMIMIIIMIIMMIIMIIIMIMIIIIIMIIIIIMIMI 

EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1139 


Qy 


1126 


LLILALI SLFS VPVI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 117 8 





IIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

Db 1140 LLILALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 



RESULT 8 
AAU33228 

ID AAU33228 standard; Protein; 1246 AA. 
XX 

AC AAU3322 8; 
XX 

DT 18-DEC-2001 (first entry) 
XX 

DE Novel human secreted protein #3719. 
XX 

KW Human; vaccination; gene therapy; nutritional supplement ; 

KW stem cell proliferation; haematopoiesis ; nerve tissue regeneration; 

KW immune suppression; immune stimulation; ant i - inflammatory; leukaemia. 

XX 

OS Homo sapiens. 
XX 

PN WO200179449-A2 . 
XX 

PD 25-OCT-2001. 
XX 

PF 16-APR-2001; 2001WO-US08656 . 
XX 

PR 18-APR-2000; 2000US-0552929 . 

PR 26-JAN-2001; 2 0 0 1US- 0770160 . 
XX 

PA (HYSE-) HYSEQ INC . 
XX 

PI Tang YT, Liu C, Drmanac RT; 
XX 

DR WPI; 2001-611725/70. 
XX 

PT Nucleic acids encoding a range of human polypeptides, useful in genetic 

PT vaccination, testing and therapy - 

XX 

PS Claim 20; Page 737; 765pp; English. 
XX 

CC The invention relates to novel human secreted polypeptides. The 

CC polypeptides and antibodies to the polypeptides are useful for 

CC determining the presence of or predisposition to a disease associated 

CC with altered levels of polypeptide. The polypeptides are also useful for 

CC identifying agents (agonists and antagonists) that bind to them. Cells 

CC expressing the proteins are useful for identifying a therapeutic agent 

CC for use in treatment of a pathology related to aberrant expression or 

CC physiological interactions of the polypeptide. Vectors comprising 

CC the nucleic acids encoding the polypeptides and cells genetically 

CC engineered to express them are also useful for producing the proteins. 

CC The proteins are useful in genetic vaccination, testing and 

CC therapy, and can be used as nutritional supplements. They may be used to 

CC increase stem cell proliferation; to regulate haematopoiesis; and in 

CC bone, cartilage, tendon and/or nerve tissue growth or regeneration; 

CC immune suppression and/or stimulation; as anti- inflammatory agents; and 

CC in treatment of leukaemias. AAU2 9 5 1 0 - AAU3 3 3 04 represent the amino acid 

CC sequences of novel human secreted proteins of the invention. 



XX 

SQ Sequence 1246 AA; 



Query Match 93.3%; Score 5526.5; DB 22; Length 1246; 

Best Local Similarity 93.0%; Pred. No. 5.9e-279; 

Matches 1123; Conservative 13; Mismatches 39; Indels 33; Gaps 10; 
Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I ! I , MMM I I M I M Ml ! I I i I II I I 

Db 42 MEDLDQSPLVSSSDSPPRPQPAFKYQFWEPEDEEEEEEEEEEDEDEDLEELEVLERKPA 101 

Qy 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

MINI III II I Mill IIMM Mill I Mill IIIMI IIIIIIIMI Illllll III 

Db 102 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 161 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

( I I I I M I I M I M I I Ml I |;| I | | | || | 

Db 162 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 221 
Qy 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

MM h : 1 1 MINI Mill III III Mill MM I MM 

Db 222 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTI SAGQEDFPSVLLETAASLPSLSP 281 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

II II I II II II M I II 1 1 II II I II 1 1 II M I M I II I II M II 1 1 II II I II II I M 1 1 

Db 282 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 341 

Qy 287 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 346 

M M I M II 1 1 M I II M I M 1 1 1 II 1 1 1 M 1 1 II II 1 1 M IIIIIIMIIIIIIIIII 

Db 342 GSSFS VS PKAESAVI VANPREE 1 1 VKNKDEEEKLVSNN I LHNQQELPTALTKLVKEDEW 4 01 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKI ESNLESKV 4 06 

M M M II I M II 1 1 1 II II 1 1 1 II II 1 1 M I II 1 1 1 1 1 1 1 II M I M II I II 1 1 M 1 1 1 

Db 4 02 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKI ESNLESKV 461 

Qy 4 07 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGI KDRSGAYI TCAPFNPAATES I ATNI F 4 66 

MM I Mill MM II I IIMM MM I MM I IIIMI Mill IIMM Mill Mill 

Db 4 62 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGI KDRSGAYI TCAPFNPAATES I ATNI F 521 

Qy 4 67 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 

Ml MUM IIIIMIIIIIIIIIMIIIIMIIM II I II 1 1 1 1 1 1 1 1 1 1 II I II I 

Db 522 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 581 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

MMIIMIIIIIIMIIMIIIIIIIMIIIMIIIIIIIIIMIIIIIIIIIIIIIII 

Db 582 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 641 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

MM MMMIIMI II III II MMMIMMIMM IIIMI Mill IIMM Mill 

Db 642 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 701 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

MMMI MMMMMMMMMMMMMMMMMMMMMMMMMM 

Db 702 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 761 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

Ml MMM I I MM I I U MMM MMMM I I II I I 



Db 


762 


MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFESMI 


821 


Qy 


766 


EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 


825 


Db 


822 


IMIM. Millll Mlllll llllill Mlllll llillll MIMM MM 

EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 


881 


Qy 


826 


SNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 

1 1 1 1 1 M 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 II M M 1 1 1 
• 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

SNDDLFISKEAQIRETETFSDSSPIEI I DEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


885 


Db 


882 


941 


Qy 


886 


ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 ' 1 1 1 1 1 1 ! 1 1 1 1 II 1 1 1 M 1 M 1 1 1 | | | | | | | M 1 M | | | | | | 1 | | | | | | | | | | 

ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 


945 


Db 


942 


1000 


Qy 


946 


TQAEI ES I VKPKVLEKEAEKKLPSDTEKEDRSPSAI FSADLGKTSWDLLYWRDI KKTGV 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 - 1 1 1 1 M 1 1 1 1 II 1 II 1 II 1 


1005 


Db 


1001 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 II I I I I I I I II 1 1 1 - 1 1 1 1 1 1 1 1 1 1 1 1 1 H 1 1 1 1 

TQAEIESI VKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSVVDLLYWRDIKKTGV 


1060 


Qy 


1006 


VFGAS -LFLLLSLTVFS I VSVTAYI ALALLS VTI SFRI YKGVI QAI QKSDEGHPFRAY- - 


1062 


Db 


1061 


| | | | | : | | | | | | | | | | | | | || | | | | | | | | | | | | | | | || | | | | | | | | | | | | | | | | | | 
VFGASAVFLLLSLTVFS I VS VTA Y I ALALLS VT I S FR I YKGVI QA I QKS DEGHPFRA I SG 


1120 


Qy 


1063 


- LESEVAI SEELVQKYSNSALGHV-NCTI KELRR LFLVDDLVDSLK- FAVLMWVFTY 

1 1 1 : : 1 ' 1 1 1 1 1 1 1 • 1 1 1 - 1 I 1 1 II 1! 1 - 1 1 1 I ! 1 1 1 1 1 


1116 


Db 


1121 


111 1 - I I I M I I • 1 1 1 • 1 i 1 IMIM' 1 II M M M I 
NLESCLYLRELGSGRYSNSALGSMWNCTVKGNFRAPSFFSWMDLVDSLRSFAVLMWVFTY 


1180 


Qy 


1117 


VGALFNGLTLL 1 LALI SLFS VP VI YERHQAQI DHYLGLANKNVKDAMAKI QAKI 

1 1 1 M M I || MM Mlllll llillll MMMMII 


1170 


Db 


1181 


VGCL- -GLMVLDTTGFWALNF I SSSGSWL I YERHQAQI DHYLGLANKNVKDAMAKI QAKI 


1238 


Qy 


1171 


PGLKRKAE 1178 




Db 


1239 


Illlllll 
PGLKRKAE 124 6 





RESULT 9 
ABU11573 

ID ABU11573 standard; Protein; 983 AA. 
XX 

AC ABU11573; 
XX 



DT 12-FEB-2003 (first entry) 
XX 

DE Human MDDT polypeptide SEQ ID 520. 
XX 

KW MDDT; human; disease detection and treatment molecule polypeptide; 

KW anti-inflammatory; immunosuppressive; osteopathic; cytostatic; anti-HIV; 

KW haemostatic; nephrotropic ; antianaemic; antipsoriatic; hepatotropic ; 

KW gene therapy; protein replacement therapy; cell proliferative disorder; 

KW cancer; adenocarcinoma; leukaemia; lymphoma; melanoma; myeloma; sarcoma; 

KW anaemia; Crohn's disease; acquired immunodeficiency syndrome; AIDS; 

KW Goodpasture's syndrome; inflammation; osteoporosis; thrombocytopaenia ; 

KW psoriasis; hepatitis. 

XX 

OS Homo sapiens. 
XX 

PN WO200279449-A2 . 



PD 


10 


-OCT- 


2002 . 






XX 












PF 


27 


-MAR- 


2002; 


2 002WO 


-US09944 


XX 












PR 


28 


-MAR- 


2001; 


2001US 


-279619P 


PR 


29 


-MAR- 


2001; 


2001US 


-280067P 


PR 


29 


-MAR- 


2001; 


2001US 


-280068P 


PR 


16 


-MAY- 


2001; 


2001US 


-291280P 


PR 


17 


-MAY- 


2001; 


2001US 


-291829P 


PR 


17 


-MAY- 


2001; 


2001US 


-291849P 


PR 


19 


-JUN- 


2001; 


2001US 


-299428P 


PR 


20 


-JUN- 


2001; 


2001US 


-299776P 


PR 


20 


-JUN- 


2001; 


2001US 


-300001P 



XX 

PA (INCY-) INCYTE GENOMICS INC. 
XX 

PI Daffo A, Jones AL, Tran AB, Dahl CR, Gietzen D, Chinn J; 

PI Dufour GE, Hillman JL, Yu JY, Tuason O, Yap PE, Amshey SR; 

PI Daugherty SC, Dam TC, Liu TF, Nguyen DA, Kleefeld Y, Gerstin EH; 

PI Peralta CM, David MH, Lewis SA, Chen AJ, Panzer SR, Harris B; 

PI Flores V, Marwaha R, Lo A, Lan RY, Urashka ME; 

XX 

DR WPI; 2003-058431/05. 

DR N-PSDB; ABX34563 . 
XX 

PT New purified disease detection and treatment molecule proteins and 

PT polynucleotides, useful for diagnosing, treating or preventing cancers 

PT (e.g. leukemia or sarcoma), anemia, Crohn's disease, AIDS, osteoporosis 

PT or hepatitis 

XX 

PS Claim 27; SEQ ID NO 520; 339pp + Sequence Listing; English. 
XX 

CC This invention describes a novel disease detection and treatment molecule 

CC polypeptide (MDDT) which has ant i -inflammatory, immunosuppressive, 

CC osteopathic, cytostatic, anti-HIV, haemostatic, nephrotropic, 

CC antianaemic, antipsoriatic and hepatotropic activity. The polynucleotides 

CC and the polypeptides of the invention can be used for gene therapy, 

CC protein replacement therapy and are useful for treating a variety of 

CC diseases or conditions. These polypeptides or polynucleotides are 

CC particularly useful for diagnosing, treating or preventing cell 

CC proliferative disorders (e.g. cancers including adenocarcinoma, 

CC leukaemia, lymphoma, melanoma, myeloma or sarcoma), anaemia, Crohn's 

CC disease, acquired immunodeficiency syndrome (AIDS), Goodpasture's 

CC syndromes, inflammation, osteoporosis, thrombocytopaenia , psoriasis or 

CC hepatitis. ABU11450-ABU11845 represent the MDDT polynucleotides encoded 

CC by ABU11450-ABU11845, described in the disclosure of the invention. 

CC NOTE: The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format from WIPO at 

CC ftp.wipo. int/pub/published_pct_sequences . 

XX 

SQ Sequence 983 AA; 



Query Match 77.0%; Score 4560; DB 24 

Best Local Similarity 97.1%; Pred. No. 7.8e-229 
Matches 919; Conservative 8; Mismatches 11 



Length 983; 

Indels 8; Gaps 3 



Qy 


240 


NLSTVL PTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVS 

h : 1 : - 1 • • - 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I 1 1 1 
■ 1 1 * *• 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 i 1 1 1 1 1 1 1 1 II M I 1 1 1 

NMNTLVICQQYYPLKEHFKKNVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVS 


293 


Db 


39 


98 


Qy 


294 


PKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKD 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 
' l I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 I 1 1 I I I I 1 I | 1 | 1 1 1 1 1 1 1 1 M 1 | | | | | | | | | 1 1 1 1 1 

PKAESAVIVANPREEI I VKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEWSSEKAKD 


353 


Db 


99 


158 


Qy 


354 


SFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFAD 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


413 


Db 


159 


f 1 * 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 I I 1 I I I I | | | | H 1 1 1 1 1 1 1 1 1 | 1 1 1 1 I 1 1 1 1 1 1 1 1 1 

SFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFAD 


218 


Qy 


414 


SLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPT 
1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 III 


473 


Db 


219 


1 1 M 1 1 1 i i 1 1 1 1 1 1 1 1 1 1 1 1 |j | 1 1 M 1 1 1 1 M 1 1 1 M | | | | | M | U | | | | | | | Ell 
SLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLGDPT 


278 


Qy 


474 


SENXTDEKKIEEKKAQI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMP 
III 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 


533 


Db 


279 


III 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 M 1 1 1 II 1 1 1 1 1 j 1 1 1 i 1 I 1 I I I | I | | | 
SENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTEEWANMP 


338 


Qy 


534 


EGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATP 
1 1 1 M 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 


593 


Db 


339 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II M 1 1 1 1 I I I I I I I I | | | 1 | [ | | | | ( | | | | | | | | 
EGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPS FEES EAT P 


398 


Qy 


594 


SPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSL 
1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 I 1 1 1 I 1 1 1 1 :' 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 M 


653 


Db 


399 


I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 I M I 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 II II 
SPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSL 


458 


Qy 


654 


-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQP 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 


712 


Db 


459 


l I I I 'I 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 i 1 1 1 1 1 1 f 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 I I I I I 

KKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQP 


518 


Qy 


713 


VPDHSELVEDSSPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFESMI EYENKEK 
1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


772 


Db 


519 


1 1 1 1 1 ' 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 I I I I M 1 I 1 1 1 1 1 1 1 II 1 || I I 11 I I I I | | | 

VPDHSELVEDSSPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFESMI EYENKEK 


578 


Qy 


773 


LSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFI 
1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


832 


Db 


579 


' 1 1 1 1 I I 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | || 1 | | | | | | | | | | | | 
LSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFI 


638 


Qy 


833 


SKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGA 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


892 


Db 


639 


I I 1 1 1 ■ N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I I I I | | | 1 1 1 1 1 1 1 
SKEAQIRETETFSDSSPIEI I DEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGA 


698 


Qy 


893 


GSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIES 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 ! 1 ! 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 


952 


Db 


699 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 I 1 I 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 
GSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA-TQAEIES 


757 


Qy 


953 


I VKPKVLEKEAEKKLPSDTEKEDRSPSAI FSADLGKTSWDLLYWRDI KKTGWFGASLF 


1012 


Db 


758 


MMIII IIIUIMMMIM! IMM 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 

I VKPKVLVKEAEKKL PS DTEKEDRS PSA I FSAELSKTSVVDLLYWRD I KKTGWFGASLF 


817 


Qy 


1013 


LLLSLTVFSI VSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEE 


1072 


Db 


818 


IIMIMMIMMIIIIIMIMIIIIIIIIIMIMIIIIMIIIIIIIIIIMIIII 

LLLSLTVFS I VS VTAY I ALALLS VT ISFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEE 


877 



Qy 1073 LVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI 1132 



I I I I I, I I I I U !U I nu:u:i M i I l! 

Db 878 LVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI 937 

Qy 1133 SLFSVPVI YERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I Mlllh Mill IIIIMMIIIIMIIIIinillllll 

Db 938 SLFSVPVI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 983 

RESULT 10 
AAY95012 

ID AAY95012 standard; Protein; 893 AA. 
XX 

AC AAY95012; 
XX 

DT 19-JUN-2000 {first entry) 
XX 

DE Human secreted protein vb22_l, SEQ ID NO: 64. 
XX 

KW Human; secreted protein; cancer; tumour; cardiovascular disorder; 

KW blood disorder; haemophilia; autoimmune disease; diabetes; inflammation; 

KW infection; fungal; bacterial; viral; HIV; allergy; arthritis,- 

KW neurodegenerative disease; asthma; contraceptive. 

XX 

OS Homo sapiens. 
XX 

PN WO200011015 X -Al. 
XX 

PD 02-MAR-2000 . 
XX 

PF 24-AUG-1999; 99WO-US19351 . 
XX 

PR 24-AUG-1998; 98US-0097638 . 

PR 24-AUG-1998; 98US- 0097659 . 

PR 09-SEP-1998; 98US-0099618 . 

PR 28-SEP-1998; 98US-0102092 . 

PR 25-NOV-1998; 98US-0109978 . 

PR 23-DEC-1998; 98US-0113645 . 

PR 23-DEC-1998; 98US-0113646 . 

PR 23-AUG-1999; 99US - 037924 6 . 
XX 

PA (ALPH-) ALPHAGENE INC. 
XX 

PI Valenzuela D, Yuan 0, Hoffman H # Hall J, Rapiejko P ; 
XX 

DR WPI; 2000-224657/19. 
XX 

PT New secreted or transmembrane proteins and polynucleotides encoding 

PT them, useful for treating neurodegenerative disorders, autoimmune 

PT diseases and cancer - 
XX 

PS Claim 73; Page 322-325; 357pp ; English. 
XX 

CC The invention relates to 40 human secreted proteins (AAY94981-Y95020) , 

CC and cDNA sequences encoding them (AAA23423 -A23462 ) . The secreted 

CC proteins of the invention include those that are thought to be only 

CC partially secreted, i.e., transmembrane proteins. The proteins of the 

CC invention may exhibit one or more activities selected from the following: 



CC cytokine activity; cell proliferation; differentiation; immune 

CC modulation; haematopoiesis regulation; tissue growth activity; 

CC activin/inhibin activity; chemotactic/chemokinet ic activity; haemostatic 

CC and thrombolytic activity; ant i- inflammatory activity; and tumour 

CC inhibition activity. The proteins may be administered to patients as 

CC vaccines, and the nucleotides may be used as part of a gene therapy 

CC regime. Diseases or conditions that may be treated using the proteins or 

CC nucleotides of the invention include autoimmune diseases; genetic 

CC disorders; haemophilia; cardiovascular diseases; cancer; bacterial, 

CC fungal and viral infections, especially HIV; multiple sclerosis; 

CC rheumatoid arthritis; pulmonary inflammation; Guillain-Barre syndrome; 

CC insulin dependent diabetes mellitus; and allergic reactions such as 

CC asthma and anaemia. They may also be used for treating wounds, burns, 

CC ulcers, osteoporosis, osteoarthritis, periodontal diseases, Alzheimer's 

CC disease, Parkinson's disease, Huntington's disease and amyotrophic 

CC lateral sclerosis (ALS) . Proteins with activin/inhibin activity may 

CC additionally be useful as contraceptives. Nucleic acid sequences of the 

CC invention may be used in chromosome mapping, and as a source of 

CC diagnostic primers and probes. The present sequence represents one of the 

CC 4 0 proteins of the invention. 

XX 

SQ Sequence 8 93 AA; 



Query Match 74.3%; Score 4400; DB 21; Length 893; 

Best Local Similarity 98.9%; Pred. No. 1.4e-220; 

Matches 884; Conservative 1; Mismatches 7; Indels 2; Gaps 2; 



Qy 


286 


MGSSFSVSPKAESAVI VANPREEI I VKN KD E E E KL VSNN I LHXQQE L PTALT KL VKED E V 345 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 I t 1 MM 1 M 1 1 M 1 


Db 


1 


II 1 M 1 1 1 II II M M II 1 II 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M I II II M 

MGSSFSVSPKAESAVI VANPREEI I VKN KDEEEKLVSNNILHNQQELPTALTKLVKEDEV 


60 


Qy 


346 


VSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESK 


405 


Db 


61 


MIIIMIIIIIMIIIMIIIIIIIIIIMIIIMMIMIIMIMMIIIIMMII 

VSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESK 


120 


Qy 


406 


VDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNI 


465 


Db 


121 


IIIIIIIMIIIIIIIIMIIIIIIIIIIIIIMIMMIIIIIMIIIIIIIIIMIM 

VDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNI 


180 


Qy 


466 


FPLLEDPTSENXTDEKKIEEKKAQI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVT 

MM MMM MIMMMMMMIMMMMMI MMMMMMMMMM 


525 


Db 


181 


FPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVT 


240 


Qy 


526 


EEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPS 

M M M M M M M MM,! ;M M iM 

EEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPS 


585 


Db 


241 


300 


Qy 


586 


FEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPY 


645 


Db 


301 


MMMMMMMMMMMMMMMMMMMIMMMMMMMMMMI 

FEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPY 


360 


Qy 


646 


EEAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYS 


704 


Db 


361 


MMMM MMMMMMMMMMMMMMMMMMMMMMMMMI 

EEAMS VSLKKVSGI KEEI KEPENI NAALQETEAPYI S I ACDLI KETKLSAEPAPDFSDYS 


420 


Qy 


705 


EMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFESM 


764 



IIIIIIIIIIIIIMIIIMIIIIIIIIIIMIMIIMIIIIIIIIIIIIMIIIMI 



Db 


421 


EMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFESM 


480 


Qy 


765 


I EYENKEKLSALPPEGGKPYLESFKXiSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAV 824 


Db 


481 


IIMIIIIIIIIIIIIMIIMMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIII 

I EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAV 54 0 


Qy 


825 


YSNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSE 


884 


Db 


541 


IMIIIIIIIIIMIMMIIIMIIIIIIIIIMIIMIMIIIIIMIIIIIIIIIII 

YSNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSE 


600 


Qy 


885 


IANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALG 

IIIIMIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIMIIIMIIMIIIIMII 

IANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA 


944 


Db 


601 


660 


Qy 


945 


HTQAEI ES I VKPKVLEKEAEKKLPSDTEKEDRSPSAI FSADLGKTSWDLLYWRDI KKTG 


1004 


Db 


661 


1 1 1 1 M 1 1 1 1 1 1 1 1 IlilMlllllllllllllllllhl 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 

-TQAEIESI VKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSVVDLLYWRDIKKTG 


719 


Qy 


1005 


VVFGASLFLLLSLTVFS I VSVTAYIALALLSVTI SFRIYKGVIQAIQKSDEGHPFRAYLE 


1064 


Db 


720 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 

VVFGASLFLLLSLTVFS I VSVTAYIALALLSVTI SFRIYKGVIQAIQKSDEGHPFRAYLE 


779 


Qy 


1065 


SEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGL 


1124 


Db 


780 


MIIIIIIMMIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIMIIMIIIMIIII 

SEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGL 


839 


Qy 


1125 


TLL I LAL I SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 




Db 


840 


MIIIIMMIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIMIIMI 

TLL I LAL I SLFSVPVI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 8 93 





RESULT 11 
AAY71310 



ID AAY71310 standard; Protein; 1163 AA. 
XX 

AC AAY71310; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat neurite growth inhibitor Nogo A. 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; ret inoblastoma ; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperprol iterative disorder; benign dysprolif erat ive disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening. 

XX 

OS Rattus sp. 
XX 

FH Key Location/Qualifiers 

FT Inhibitory-site 1..171 

FT /note= "Inhibits NIH 3T3 fibroblast spreading" 

FT Modif ied-site 30 

FT /note= "Casein kinase II site" 

FT Region 31 . .58 



FT 






/note= 


"Acidic region" 




FT 


Region 




31 . . 57 






FT 






/note= 


"Region specifically described 


FT 


Modified 


-site 


233 






FT 






/note= 


"Protein kinase C (PKC) 


site" 


FT 


Modified 


-site 


242. .244 




FT 






/note= 


"Asn is N-glycosylated" 




FT 


Modified 


-site 


291 






FT 






/note- 


"Protein kinase C (PKC) 


site" 


FT 


Modified 


-site 


295 






FT 






/note= 


"Protein kinase C (PKC) 


site" 


FT 


Misc-dif f erence 


404 






FT 






/note- 


"Encoded by TTG" 




FT 


Modif ied- 


-site 


436 






FT 






/note= 


"Protein kinase C (PKC) 


site" 


FT 


Modified- 


-site 


468 . .470 




FT 






/note= 


"Asn is N-glycosylated" 




FT 


Modif ied- 


-site 


484 






FT 






/note= 


"Protein kinase C (PKC) 


site" 


FT 


Modif ied- 


-site 


488 






FT 






/note= 


"Protein kinase C (PKC) 


site" 


FT 


Modif ied- 


-site 


502 






FT 






/note- 


"Casein kinase II site" 




FT 


Modif ied- 


-site 


576 






FT 






/note= 


"Casein kinase II site" 




FT 


Modif ied- 


-site 


626 






FT 






/note= 


"Protein kinase C (PKC) 


site" 


FT 


Modif led- 


-site 


694 . . 696 




FT 






/note= 


"Asn is N-glycosylated" 




FT 


Modif ied- 


-site 


715 






FT 






/note= 


"Casein kinase II site" 




FT 


Modified- 


-site 


784 






FT 






/note= 


"Protein kinase C (PKC) 


site" 


FT 


Modified- 


-site 


821 






FT 






/note- 


"Protein kinase C (PKC) 


site" 


FT 


Modified- 


■site 


850 






FT 






/note= 


"Protein kinase C (PKC) 


site" 


FT 


Modir lea- 


site 


855 






FT 






/note= 


"Protein kinase C (PKC) 


site" 


FT 


Modi fied- 


site 


863 






FT 






/note- 


"Casein kinase II site" 




FT 


Modif ied- 


■site 


868 






FT 






/note- 


"Protein kinase C (PKC) 


site" 


FT 


Modif ied- 


site 


893 






FT 






/note= 


"Protein kinase C (PKC) 


site" 


FT 


Modif ied- 


site 


912 . . 914 




FT 






/note= 


"Asn is N-glycosylated" 




FT 


Modif ied- 


site 


925 . . 927 




FT 






/note- 


"Asn is N-glycosylated" 




FT 


Modif ied- 


site 


954 






FT 






/note= 


"PKC and casein kinase II sites 


FT 


Modif ied- 


site 


956 






FT 






/note= 


"PKC and casein kinase II sites 


FT 


Domain 




988 . . 1023 




FT 






/label = 


Transmembrane domain 




FT 






/note= 


"C- terminal hydrophobic 


region 


FT 






specifically described in claim 


16" 



FT 


Modif ied-site 


1024 




FT 




/note= "Protein kinase C (PKC) site" 




FT 


Modified- site 


1071 . . 1073 




FT 




/note= "Asn is N-glycosylated" 




FT 


Modif ied-site 


1073 




FT 




/note= "Protein kinase C (PKC) site" 




FT 


Modif ied-site 


1089 




FT 




/note= "Protein kinase C (PKC) site" 




FT 


Domain 


1090. .1125 




FT 




/label = Transmembrane domain 




FT 




/note= "C- terminal hydrophobic region 




FT 




specifically described in claim 16" 




FT 


Modif ied-site 


1141 . . 1143 




FT 




/note= "Asn is N-glycosylated" 




FT 


Modif ied-site 


1143 




FT 




/note= "Protein kinase C (PKC) site" 




FT 


Peptide 


623. .640 




FT 




/note= "used as immunogen to generate antibody AS 


472" 


FT 


Peptide 


762 . . 1163 




FT 




/note= "used as immunogen to generate antibody AS 


Bruna 


FT 


Inhibitory-site 


542.. 722 




FT 


Region 


172 . .259 




FT 




/note- "This region is not essential for inhibitory 


FT 




activity" 




FT 


Region 


975. .1162 




FT 




/note= "This region is not essential for inhibito: 


ry 


FT 




activity" 


FT 


Region 


976. .1163 




FT 




/note= "C- terminal common region found in Nogo A, 


B and 


FT 




C isoforms" 




XX 








PN 


WO200031235-A2. 






XX 








PD 


02-JUN-2000 . 






XX 








PF 


05-NOV-1999; 99WO-US26160 . 




XX 








PR 


06 -NOV- 1998 ; 98US- 0 10744 6 . 




XX 








PA 


(SCHW/) SCHWAB M 


E. 




PA 


(CHEN/) CHEN M S 






V V 

AA 








PI 


Schwab ME, Chen 


MS; 




XX 








DR 


WPI; 2000-400052/34. 




DR 


N-PSDB; AAD01173 






XX 








PT 


Nogo proteins and nucleic acids useful for treating neoplastic 




PT 


disorders of the 


central nervous system and inducing regeneration 


of 


DT 


neurons - 






XX 








PS 


Claim 3; Fig 2A; 


122pp; English. 




XX 








cc 


The present sequence is a rat Nogo A protein which is a 




cc 


potent neural cell growth inhibitor and is free of all central nervous 


cc 


system (CNS) myelin material with which it is natively associated. 


The 


cc 


protein was derived from a cDNA generated by fusing R018U37-3, Rl- 


■3U21 



CC cDNAs isolated from hexanucleot ides -primed rat brain stem/spinal cord 

CC library, and 01il8 cDNA from an oligo d(T) -primed rat oligodendrocyte 

CC library. Nogo proteins and fragments displaying neurite growth 

CC inhibitory activity are used in the treatment of neoplastic disease of 

CC the CNS e.g. glioma, glioblastoma, medullobla stoma , craniopharyngioma, 

CC ependyoma, pinealoma, haemangioblastoma , acoustic neuroma, 

CC oligodendroglioma, menagioma, neuroblastoma or retinoblastoma and 

CC degenerative nerve diseases e.g. Alzheimer's and Parkinson's diseases. 

CC Therapeutics which promote Nogo activity can be used to treat or prevent 

CC hyperprol iterative or benign dysprol iterative disorders e.g. psoriasis 

CC and tissue hypertrophy. Ribozymes or antisense Nogo nucleic acids can be 

CC used to inhibit production of Nogo protein to induce regeneration of 

CC neurons or to promote structural plasticity of the CNS in disorders where 

CC neurite growth, regeneration or maintenance are deficient or desired. 

CC The animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which 

CC can treat or prevent disorders or diseases of the CNS. 

CC Note: The present sequence designated as SEQ ID NO: 2 is stated to 

CC be the same as the sequence shown in Fig. 13 (see AAY71384) of the 

CC specification. However, this sequence does not match the sequence given 

CC in Fig. 13. SEQ ID numbers 35-42 are referred in claim 32 and 

CC SEQ ID NO: 29 in disclosure of the specification. However, the 

CC specification does not include sequences for these SEQ ID numbers. 

XX 

SQ Sequence 1163 AA; 

Query Match 72.5%; Score 4296.5; DB 21; Length 1163; 

Best Local Similarity 74.0%; Pred. No. 4.8e-215; 

Matches 885; Conservative 104; Mismatches 156; Indels 51; Gaps 19; 

Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Mhlll Mill I I I I I I IIIIIIII MINI llllhl|::||IMIIIIIIII 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

QY 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

MIMIII M Ml I : I I HI IIIIIIII: II : I 

D k 61 PAAGLSAAAVP - - PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

Qy H9 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

„ III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II IIIIIIMMIII 

Db 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Qy 179 RGSSG AWXXXXKI MDLKEQPGNTI SAGQEDFPS VLLETAASXPSL 224 

Ml : Mill Ml llh hi MM I III II I II I IN 

Db 167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 226 

Qy 225 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 284 

MM Mill II I I M I :|||::| ::|||||: h| ::||l I I f I I I I I 

Db 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

Qy 285 EMGSSFSVSPKAESAVI VANPREEI IVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDE 344 

IMMI III |||::| | :|h||::||:|: || = || || | ||| 
Db 287 EMGSSFKGSPKGESAILVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KEDR 34 0 

Qy 345 WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNLE 403 

Ml II I III ::M IhMIMIIMIh Mllh I I hill MM 
Db 341 WSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANVE 396 



Qy 


404 


SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGI KDRSGAYI TCAPFNPAATES IAT 


463 


Db 


397 


MINI Mill : MM hi MMIM Ml 1 llllll 1 MMI 

SKVDRKCIEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTTA 


455 


Qy 


464 


NI FPLLEDPTSENXTDEKKI EEKKAQI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 
1 llllll MM 1 1 1 1 1 1 M M 1 1 1 M II II 11 1 M 1 II 1 1 1 1 llllll i.i 


Db 


456 


i ' ' i 1 I • I I I I • i I 1 || llllll || MM I M 1 1 1 1 ■ 1 

NTFPLLEDHTSENKTDEKKIEERKAQI ITEK-TSPKTSNPFLVAVQDSEADYVTTDTLSK 


514 


Qy 


524 


VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 583 

III MMi 1 1 1 1 1 1 IIIIIIMIhllMIM I'M Ml 

VTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 574 


Db 


515 


Qy 


584 


PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSWYESIKHEPENP 


642 


Db 


575 


1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 MMMIIMM MMI IMMM Mill 

PSFEEAEATPSPVLPDI VMEAPLNSLLPSAGASVVQPSVSPLEAPPPVSYDSIKLEPENP 


634 


Qy 


643 


PPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 


702 


Db 


635 


MM hM I || MMM II 1 M 1 II II 1 II 1 1 1 1 II 1 1 1 1 M Ihlllh 

PPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYI SI ACDLI KETKLSTEPSPDFSN 


694 


Qy 


703 


YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 


762 


Db 


695 


MM h MMMIIIIIIIMIMMIIIMIIMII Ml MMIM 1 1 

YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-E 


753 


Qy 


763 


SMI EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELST 


822 


Db 


754 


- = = = MMM 1 1 IMMMM M MM MMM Mill M 

TVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEFNT 


811 


Qy 


823 


AVYSNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHK 


882 


Db 


812 


MMI Ml MMMMIMMIIIIMIM MM II MMMIIMM 1 

AI YSNDDLLSSKEDKIKESETFSDSSPIEI IDEFPTFVSAKDDS-PKLAKEYTDLEVSDK 


870 


Qy 


883 


SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 


942 


Db 


871 


HIM II MM III III III M ::: IMMM h II MMM 

SEIANIQSGADSLPCLELPCDLSFKNI YPK- -DEVHVSDEFSENRSSVSKASI SPSNVSA 


928 


Qy 


943 


LGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKK 

1 1 h MM 1 1 IIIIIIIIIIIIIIMI IM MM ' 

L-EPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKK 


1002 


Db 


929 


987 


Qy 


1003 


TGVVFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAY 


1062 


UQ 


o o o 
9 88 


M 1 M 1 1 M 1 M 1 M II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1| 1 1 M M 1 1 1 M 1 1 1 1 II 1 1 1 

TGWFGASLFLLLSLTVFS I VS VTAY I ALALLS VT I S FR I YKGVI QAI QKSDEGHPFRAY 


1047 


Qy 


1063 


LESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 


1122 


Db 


1048 


i ' IIMIIIIMIIMIMIIMIIIIIMIMIIM 

LESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 


1107 


Qy 


1123 


GLTLL I LALI SLFS VPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

M II 1 1 1 1 1 II M 1 M II 1 1 1 II IIMIIIIMIMIIIIIIMIIIMMMM 




Db 


1108 


GLTLL I LALI SLFS I P VI YERHQVQ I DH YLGLANKS VKDAMAKI QAKI PGLKRKAD 1163 





RESULT 12 
ABB81074 

ID ABB81074 standard; Protein; 1163 AA. 



XX 

AC ABB81074; 
XX 

DT 05-NOV-2002 (first entry) 
XX 

DE Rat neurotransmitter receptor protein Nogo-A. 
XX 

KW Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 

KW central nervous system; peripheral nervous system; tranquillizer; Nogo; 

KW vulnerary; cerebroprotective; anti-tumour; antidiabetic; anticonvulsant; 

KW nootropic; antiparkinsonian; ophthalmological ; analgesic; hepatotropic ; 

KW osteopathic; vasotropic; nephrotropic; cytostatic; antigen; gene therapy; 

KW neurotransmitter receptor; rat; receptor. 
XX 

OS Rattus norvegicus . 
XX 

PN US2002072493-A1 . 
XX 

PD 13-JUN-2002 . 
XX 

PF 28-JUN-2001; 2 001US- 08 9334 8 . 
XX 

PR 19-MAY-1998; 98 IL- 0 1245 00 . 

PR 21-JUL-1998; 98WO-US14715 . 

PR 22-DEC-1998; 98US- 02 18277 . 

PR 19 -MAY- 19 99; 99US- 03 14 16 1 . 
XX 

PA (YEDA ) YEDA RES & DEV CO LTD. 
XX 

PI Eisenbach-Schwartz M, Hauben E, Cohen IR, Beserman P, Mosonego A; 

PI Moalem G; 

XX 

DR WPI; 2002-607255/65. 

DR N-PSDB; ABN86600- 
XX 

PT Promoting nerve regeneration and preventing neuronal degeneration in 

PT the central /peripheral nervous system from injury/disease, comprises 

PT administering nervous system-specific activated T cells/antigen, or 

PT analogs/peptides 
XX 

PS Example 5; Page 4 4-47; 93pp; English. 
XX 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central/peripheral nervous system (NS) . The method involves administering 

CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue or 

CC combinations. The method is useful for promoting nerve regeneration and 

CC preventing neuronal degeneration in central /peripheral nervous system 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery such as tumour excision. The disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease 

CC is diabetic neuropathy, senile dementia, Alzheimer's disease, Parkinson's 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington's chorea, 

CC amyotrophic lateral sclerosis, non-arteritic optic neuropathy, and 



CC vitamin deficiency, intervertebral disc herniation, prion diseases such 

CC as Creutzfeldt- Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 

CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia vera, immunoglobulin (Ig)A- and IgG gamma- 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e.g., alcohol or organophosphates) , Charcot -Marie-Tooth disease, ataxia 

CC telangectasia, Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy, Giant axonal neuropathy, Ref sum's disease, Fabry's 

CC disease, or lipoproteinemia . The present sequence represents the rat 

CC neurotransmitter receptor protein Nogo-A, an example of NS-specific 

CC antigen. 

XX 

SQ Sequence 1163 AA; 



Query Match 72.5%; Score 4296.5; DB 23; Length 1163; 

Best Local Similarity 74.0%; Pred. No. 4.8e-215; 

Matches 885; Conservative 104; Mismatches 156; Indels 51; Gaps 19; 



Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

M I H 1 1 Mill llllll IIIIIIII llllll MMMMMMIMMI II 

Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

IIIIIIII II III Mhll M llllllllllllll lllllllh II : I 
Db 61 PAAGLSAAAVP- - PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

Qy 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

III 1 1 I I I I I I . IN I I I I IN II II Mllllll ; 

Db 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Qy 179 RGSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSL 224 

Ml : MM! : Ml 

Db 167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 22 6 

Qy 225 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 284 

MM Mill MMM I :|lh:| i hi -III IIIIIIII 

Db 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

Qy 285 EMGSSFSVSPKAESAVI VANPREEI IVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDE 344 

llllll Ml I I I ^ : I I H I : I I : : I I : I : II = || || I Ml 
Db 287 EMGSSFKGSPKGESAILVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KEDR 340 

Qy 345 WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNLE 403 

Ml II I III ::M I M II II I II I II M Mllh I I hill MM 
Db 341 WSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANVE 396 

Qy 404 SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIAT 463 

Mlhll Mill = MM hi 1 1 1 J I J I Ml I MMM I Mill 

Db 3 97 SKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTTA 455 

Qy 4 64 NIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 

I MMM MM IIIM 'hill II II MMM M MM MMM hi 

Db 456 NTFPLLEDHTSENKTDEKKIEERKAQI ITEK-TSPKTSNPFLVAVQDSEADYVTTDTLSK 514 



Qy 


524 


VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 
Ml 1 ' 1 1 1 1 1 1 1 I 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 


583 


Db 


515 


I I I I I I I 1 i 1 I I I 1 1 1 1 1 1 1 1 1 I 1 1 II ! 1 1 M II II M 1 * II 1 1 1 1 IMI 

VTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 


574 


Qy 


584 


PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 


642 


Db 


575 


| | | | | : | | | | | | | | | | | | | | | | | | | : | | | | | M : | | | | | | | | : | : | | j | | | | | 
PSFEEAEATPSPVLPDI VMEAPLNSLLPSAGASWQPSVS PLEAPPPVS YDS I KLEPENP 


634 


Qy 


643 


PPYEEAMSVSLKVSGI KEEI KEPENI NAALQETEAPYI S IACDLI KETKLSAEPAPDFSD 


702 


Db 


635 


| | | | | | | : | : | | | || | | | | | : I | H | | ! 1 1 I 1 1 1 1 1 1 1 1 I I | I || | | : | | | | : 
PPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSN 


694 


Qy 


703 


YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFE 


762 


Db 


695 


| | | : | | | : | h | : | M 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 h 1 M hi I H 1 1 1 1 1 1 1 

YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-E 


753 


Qy 


763 


SMI EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELST 


822 


Db 


754 


: : : : : : | : | | | | | | | | | | | | | : : | : | | | : : | | : | | | | | | | | | | : | 
TVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDI PTLTKKEKISLQMEEFNT 


811 


Qy 


823 


AVYSNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKIjAREYTDLEVSHK 


882 


Db 


812 


| : | | | | | | Ml : | : | : | | | M 1 1 1 1 1 1 1 1 1 1 1 1 '\ • 1 M 1 1 M 1 1 1 1 1 1 I I 
AIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVSDK 


870 


Qy 


883 


SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 
1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 II | | : | | : j | : | | : | : | | | 


942 


Db 


871 


SEIANIQSGADSLPCLELPCDLSFKNIYPK- -DEVHVSDEFSENRSSVSKASISPSNVSA 


928 


Qy 


943 


LGHTQAEIESI VKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSVVDLLYWRDIKK 


1002 


Db 


929 


1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 i 1 h 1 h I 1 1 1 1 1 1 1 1 I 1 1 1 1 II 

L-EPQTEMGS I VKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTS WDLLYWRDI KK 


987 


Qy 


1003 


TGWFGASLFLLLSLTVFS I VS VTAYI AIALLS VTI SFRI YKGVI QAI QKSDEGHPFRAY 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 


1062 


Db 


988 


TGWFGASLFLLLSLTVFS I VSVTAY I ALALLSVTI SFRI YKGVI QAI QKSDEGHPFRAY 


1047 


Qy 


1063 


LESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 


1122 


Db 


1048 


1 M 1 1 1 1 1 1 1 1 1 1 M 1 II 1 !il i IN II 1 II II 1 1! Ill 1 1 1 il 1 1 1 1 

LESEVAI SEEL VQKYSNSALGHVNST I KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 


1107 


Qy 


1123 


GLTLLI LALI SLFS VP VI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

IIIMIMIIIIIhllllllM llllllllllhlllMlllllllllllllh 




Db 


1108 


GLTLLI LAL I SLFS I P VI YERHQVQ I DHYLGLANKS VKDAMAKI QAKI PGLKRKAD 1163 





RESULT 13 
AAY71557 

ID AAY71557 standard; Protein; 1162 AA. 
XX 



AC AAY71557; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat Nogo A truncated protein used in the construction of mutant Nogo-A. 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 



KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperprol iterative disorder; benign dysprol iterative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment ; 

KW structural plasticity; screening; mutant; mutein. 

XX 

OS Rattus sp. 
XX 

PN WO2 00031235-A2 . 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 9 9WO-US26160 . 
XX 

PR 06-NOV-1998; 98US-0107446 . 
XX 

PA (SCHW/) SCHWAB ME. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34, 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic 

PT disorders of the central nervous system and inducing regeneration of 

PT neurons - 

XX 

PS Example; Page - ; 122pp; English. 
XX 

CC The patent relates to neurite growth inhibitor Nogo which is free of 

CC all central nervous system (CNS) myelin material with which it is 

CC natively associated. Nogo proteins and fragments displaying neurite 

CC growth inhibitory activity are used in the treatment of neoplastic 

CC disease of the CNS e.g. glioma, glioblastoma, medulloblastoma , 

CC craniopharyngioma, ependyoma, pinealoma, haemangioblastoma , acoustic 

CC neuroma, oligodendroglioma, menagioma, neuroblastoma or retinoblastoma 

CC and degenerative nerve diseases e.g. Alzheimer's and Parkinson's 

CC diseases. Therapeutics which promote Nogo activity can be used to treat 

CC or prevent hyperprolif erat ive or benign dysprol iterative disorders e.g. 

CC psoriasis and tissue hypertrophy. Ribozymes or antisense Nogo nucleic 

CC acids can be used to inhibit production of Nogo protein to induce 

CC regeneration of neurons or to promote structural plasticity of the CNS 

CC in disorders where neurite growth, regeneration or maintenance are 

CC deficient or desired. The animal models can be used in diagnostic and 

CC screening methods for predisposition to disorders and to screen for or 

CC test molecules which can treat or prevent disorders or diseases of the 

CC CNS . The present sequence is a truncated form of rat Nogo A protein shown 

CC in AAY71310, which is used in the construction of mutant Nogo-A. Nogo-A 

CC is composed of His-tag/T7-tag/vector/Nogo-A sequence aa 1-1162. 

CC Nogo A deletion mutants were used for mapping the inhibitory sites of 

CC Nogo protein. Major inhibitory region was identified in the 

CC Nogo A sequence from amino acids 172-974, particularly amino acids 

CC 542-722. In addition, N-terminal region 1-171 was found to be inhibitory 

CC to NIH 3T3 fibroblast spreading. 

CC Note: The present sequence is not given in the specification but is 

CC derived from rat Nogo A sequence shown in AAY71310. SEQ ID numbers 35-42 

CC are referred in claim 32 and SEQ ID NO: 29 in disclosure of the 



CC specification. However, the specification does not include sequences for 

CC these SEQ ID numbers. 

XX 

SQ Sequence 1162 AA; 

Query Match 72.5%; Score 4294.5; DB 21; Length 1162; 

Best Local Similarity 74.1%; Pred. No. 6.1e-215; 

Matches 885; Conservative 103; Mismatches 156; Indels 51; Gaps 19; 
Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Mhlli Mill Illlll llllllll Illlll MMhlhMMMMMMM 

Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVP PAPRGPLPAAP P VAPERQPSWDPS PVSSTVP 118 

llllllll M III llhll :| I. Illllllh II : I 

Db 61 PAAGLSAAAVP- -PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

QY 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Ml 1 1 1 1 IIIIIIIIIIIIIIIIIIM II II Mill 

Db 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Qy 17 9 RGSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSL 224 

Ml ^ Mill 1 1 1 M h M II I II II II 1 1 1 II I III 

Db 167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 22 6 

Qy 225 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 284 

MM Mill Illlll I :|||::| ::|Mlh hi ::||| llllllll 

Db 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

Qy 285 EMGSSFSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDE 344 

MIMI III II M M I : I I : I I : : I I : I : II = II II I 
Db 287 EMGSS FKGS PKGESA I LVENTKEE VI VRSKDKED - LVCSAALHS PQES PVG KEDR 340 

Qy 345 WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDS-DMLAAGGKIESNLE 4 03 

Ml M I III : = = | 1 1 = 1 1 1 1 1 1 M 1 1 1 ^ Mllh I I hill MM 

Db 341 WSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANVE 3 96 

Qy 404 SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIAT 463 

Nihil Mill : MM hi lllllll Ml I Illlll I Mill 

Db 397 SKVDRKCIEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTTA 455 

Qy 464 NIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 

I MINI MM MMMIhlllhlM II MIMI II MM Illlll hi 

Db 456 NTFPLLEDHTSENKTDEKKIEERKAQI ITEK-TSPKTSNPFLVAVQDSEADYVTTDTLSK 514 

Qy 524 VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 583 

Mi h I M 1 1 II I II M II I II II 1 1 1 1 1 i 1 1 1 1 1 1 - 1 [ 1 1 1 1 1 M 1 1 II I MM 

Db 515 VTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 574 

Qy 584 PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 642 

1 1 1 1 1 = f 1 1 1 1 M M 1 1 1 1 M 1 1 M MIIMMMM Mill hhlll Mill 

Db 575 PSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENP 634 

Qy 643 PPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 702 

I I I I | || IIMh I I I I I I 'I IhlMh 

°t> 635 PPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSN 694 



Qy 


703 


YSEMAKVEQPVPDHSELVEDSS PDSEPVDLFSDDS I PDVPQKQDETVMLVKESLTETSFE 762 
1 1 1 • 1 1 1 . 1 1 - 1 - 1 II 1 1 1 1 1 1 II 1 1 1 I 1 1 1 II Mil j 


Db 


695 


i i i • i i i • i l * l • M ll M 11 * 1 II 1 II 1 II II II ■ 1 II 1 * 1 1 1 I : 1 1 1 1 1 I 1 1 
YSE I AKFEKSVPEHAELVEDSSPESEPVDLFSDDS I PEVPQTQEEAVMLMKESLTEVS - E 


753 


Qy 


763 


SMI EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELST 822 
1 . 1 1 1 1 1 1 1 1 1 1 1 1 1 . .1 .mi i i i i i i i i i i i i i 


Db 


754 


1 ■ * -1 -III : : : 1 1 : 1 1 1 1 1 1 1 1 1 1 : 1 

TVAQHK-EERLSASPQELGKPYLESFQPNLHSTfCDA-ASNDIPTLTKKEKISLQMEEFNT 


811 


Qy 


823 


AVYSNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHK 

-II ill 1 1 I 1 1 || iihiiiiiiii | 

AI YSNDDLLSSKEDKI KESETFSDSSPI EI IDEFPTFVSAKDDS - PKLAKEYTDLEVSDK 


882 


Db 


812 


870 


Qy 


883 


SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 


942 


Db 


871 


Mill 11 Illl III III III II ::: ! : | 1 

SEIANIQSGADSLPCLELPCDLSFKNI YPK- -DEVHVSDEFSENRSSVSKASISPSNVSA 


928 


Qy 


943 


LGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKK 


1002 


Db 


929 


| | | : M 1 1 1 1 lllllllllllllllll | | : Ihl 1 III 1 1 1 1 I 1 1 1 1 1 1 
L-EPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKK 


987 


Qy 


1003 


TGWFGASLFLLLSLTVFS I VSVTAYI ALALLSVTI SFRI YKGVI QAI QKSDEGHPFRAY 


1062 


Db 


988 


MIIIIMIIIIIIIIMIIIIIIMIIMIIIIIMIIMMIIIIMIIIIMMIII 

TGWFGASLFLLLSLTVFS I VSVTAYI ALALLSVTI SFRI YKGVI QAI QKSDEGHPFRAY 


1047 


Qy 


1063 


LESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 


1122 


Db 


1048 


1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1| || 1 1 1 1 1 1 1 1 

LESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 


1107 


Qy 


1123 


GLTLLI LALI SLFS VPVI YERHQAQ I DH YLGLANKNVKDAMAKI QAKI PGLKRKA 1177 

IMIIIIIIIIIIhllllllll llllllllllhllMIIIIIIIIMIMII 




Db 


1108 


GLTLLI LALI SLFS I PVI YERHQVQ I DH YLGLANKS VKDAMAKI QAKI PGLKRKA 1162 





RESULT 14 
AAY71384 

ID AAY71384 standard; Protein; 1163 AA. 
XX 

AC AAY71384; 
XX 

DT 02-NOV-2000 (first entry) 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
KW 
KW 



Alternative version of rat neurite growth inhibitor Nogo A. 



Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 
central nervous system; neoplastic disease; antiproliferative; glioma; 
antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 
degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 
hyperprol iterative disorder; benign dysprol if erative disorder; diagnosis; 
psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 
KW structural plasticity; screening-. 
XX 

OS Rattus sp. 
XX 

FH Key Location/Qualifiers 
FT Inhibitory-site 1 . . 171 

FT /note- "Inhibits NIH 3T3 fibroblast spreading" 

FT Modif ied-site 30 



FT 






/note= 


"Casein kinase II site' 


i 


FT 


Region 




31. .58 






FT 






/note= 


"Acidic region" 




FT 


Modified 


-site 


233 






FT 






/note- 


"Protein kinase C (PKC) 


site" 


FT 


Modified 


-site 


242 . .244 




FT 






/note- 


"Asn is N-glycosylated' 




FT 


Modified 


-site 


291 






FT 






/note= 


"Protein kinase C ( PKC) 


site" 


FT 


Modif ied- 


-site 


295 






FT 






/note= 


"Protein kinase C (PKC) 


site" 


FT 


Modif ied- 


-site 


436 






FT 






/note= 


"Protein kinase C (PKC) 


site" 


FT 


Modif ied- 


-site 


468 . .470 




FT 






/note= 


"Asn is N-glycosylated' 




FT 


Modif ied- 


-site 


484 






FT 






/note= 


"Protein kinase C (PKC) 


site" 


FT 


Modif ied- 


-site 


488 






FT 






/note= 


"Protein kinase C ( PKC) 


site" 


FT 


Modif ied- 


-site 


502 






FT 






/note= 


"Casein kinase II site* 




FT 


Modif ied- 


-site 


576 






FT 






/note= 


"Casein kinase II site' 




FT 


Modif ied- 


-site 


626 






FT 






/note= 


"Protein kinase C (PKC) 


site" 


FT 


Modif ied- 


-site 


694. .696 




FT 






/note= 


"Asn is N-glycosylated" 




FT 


Modif ied- 


-site 


715 






FT 






/note= 


"Casein kinase II site" 




FT 


Modif ied- 


-site 


784 






FT 






/note= 


"Protein kinase C (PKC) 


site" 


FT 


Modif ied- 


-site 


821 






FT 






/note- 


"Protein kinase C (PKC) 


site" 


FT 


Modif ied- 


-site 


850 






FT 






/note= 


"Protein kinase C (PKC) 


site" 


FT 


Modif ied- 


-site 


855 






FT 






/note= 


"Protein kinase C (PKC) 


site" 


FT 


Modif ied- 


-site 


863 






FT 






/note= 


"Casein kinase II site" 




FT 


Modif ied- 


site 


868 






FT 






/note- 


"Protein kinase C (PKC) 


site" 


FT 


Modified- 


site 


893 






FT 






/note- 


"Protein kinase C (PKC) 


site" 


FT 


Modif ied- 


site 


912 . . 914 




FT 






/note= 


"Asn is N-glycosylated" 




FT 


Modif ied- 


site 


925 . . 927 




FT 






/note- 


"Asn is N-glycosylated" 




FT 


Modif ied- 


site 


954 






FT 






/note- 


"PKC and casein kinase 


II sites" 


FT 


Modif ied- 


site 


956 






FT 






/note- 


"PKC and casein kinase 


II sites" 


FT 


Domain 




988 . . 1023 




FT 






/label = 


Transmembrane_domain 




FT 






/note= 


"C- terminal hydrophobic 


region " 


FT 


Modif ied- 


site 


1024 






FT 






/note- 


"Protein kinase C (PKC) 


site" 


FT 


Modif ied- 


site 


1071 . . 1073 





FT /note= "Asn is N-glycosylated" 

FT Modif ied-site 1073 

FT /note= "Protein kinase C (PKC) site" 

FT Modif ied-site 1089 

FT /note= "Protein kinase C (PKC) site" 

FT Domain 1090 -.1125 

FT /label= Transmerrtbrane_domain 

FT /note= "C- terminal hydrophobic region" 

FT Modif ied-site 1141 ..1143 

FT /note= "Asn is N-glycosylated" 

FT Modif ied-site 1143 

FT /note= "Protein kinase C (PKC) site" 

FT Peptide 623 . . 640 

FT /note= "used as immunogen to generate antibody AS 472" 

FT Peptide 762 . . 1163 

FT /note= "used as immunogen to generate antibody AS Bruna 

FT Inhibitory-site 542 . . 722 

FT Region 172. .259 

FT /note= "This region is not essential for inhibitory 

FT activity" 

FT Region 975 . . 1162 

FT /note= "This region is not essential for inhibitory 

FT activity" 

FT Region 976 . . 1163 

FT /note= "C-terminal common region found in Nogo A, B and 

FT C isoforms" 

FT Misc-dif f erence 223 

FT /label= Unknown 

FT /note= "There is Leu at this position in the 

FT sequence shown in AAY71310" 

FT Misc-dif ference 404 

FT /note= "There is lie at this position in the 

FT sequence shown in AAY71310" 

FT Misc-dif ference 469 

FT /label = Unknown 

FT /note= "There is Lys at this position in the 

FT sequence shown in AAY71310" 

FT Misc-dif ference 661 

FT /note= "There is Asn at this position in the 

FT sequence shown in AAY71310" 

FT Misc-dif ference 820 

FT /note= "There is Leu at this position in the 

FT sequence shown in AAY71310" 

XX 

PN WO200031235-A2 . 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US26160 . 
XX 

PR 06-NOV-1998; 98US- 010744 6 . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 



DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic 

PT disorders of the central nervous system and inducing regeneration of 

PT neurons - 

XX 

PS Claim 3; Fig 13; 122pp ; English. 
XX 

CC The present sequence is an alternative version of rat Nogo A protein 

CC which is a potent neural cell growth inhibitor and is free of all 

CC central nervous system (CNS) myelin material with which it is 

CC natively associated. Nogo proteins and fragments displaying 

CC neurit e growth inhibitory activity are used in the 

CC treatment of neoplastic disease of the CNS 

CC e.g. glioma, glioblastoma, medulloblastoma, craniopharyngioma, 

CC ependyoma, pinealoma, haemangioblastoma , acoustic neuroma, 

CC oligodendroglioma, menagioma, neuroblastoma or retinoblastoma and 

CC degenerative nerve diseases e.g. Alzheimer's and Parkinson's diseases. 

CC Therapeutics which promote Nogo activity can be used to treat or prevent 

CC hyperprol iterative or benign dysprol iterative disorders e.g. psoriasis 

CC and tissue hypertrophy. Ribozymes or antisense Nogo nucleic acids can be 

CC used to inhibit production of Nogo protein to induce regeneration of 

CC neurons or to promote structural plasticity of the CNS in disorders where 

CC neurite growth, regeneration or maintenance are deficient or desired. 

CC The animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which 

CC can treat or prevent disorders or diseases of the CNS. 

CC Note: The present sequence is an alternative version of the 

CC Nogo A sequence shown in Fig. 2A (see AAY71310) . 

CC SEQ ID numbers 35-42 are referred in claim 32 and SEQ ID NO: 29 

CC in disclosure of the specification. However the specification does not 

CC include sequences for these SEQ ID numbers. 

XX 

SQ Sequence 1163 AA; 

Query Match 72.4%; Score 4286.5; DB 21; Length 1163; 

Best Local Similarity 74.1%; Pred. No. 1.6e-214; 

Matches 886; Conservative 104; Mismatches 155; Indels 51; Gaps 19 
Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 



Db 




Qy 




Db 



119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 



Db 





Qy 



179 RGSSG AWXXXXKI MDLKEQPGNTI SAGQEDFPSVLLETAASXPSL 224 



Db 



167 RGSGSVDETLFALPAASEPVT PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASXPSL 22 




Qy 



225 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLI DRDLTEFSELEYS 284 



Db 


227 


SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 


286 


Qy 


285 


EMGSSFSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDE 


344 


Db 


287 


1 1 M 1 1 1 1 1 1 1 h M 1 = 1 h 1 1 : M h 1 : 1 1 : II III 1 1 1 
EMGSSFKGS PKGESAI LVENTKEEVI VRSKDKED - LVCSAALHSPQESPVG KEDR 


340 


Qy 


345 


WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS -DMLAAGGKI ESNLE 


403 


Db 


341 


Ml M 1 III ^1 Ihllllllllllh Ml'- | | hill :|:| 
WSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANVE 


396 


Qy 


404 


SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIAT 


463 


Db 


397 


Nihil Mill : MM hi III ! Ml 1 IIIMI I Mill 

SKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTTA 


455 


Qy 


464 


NI FPLLEDPTSENXTDEKKI EEKKAQI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 

1 llllll II 1 1 II 1 1 1 II 1 h 1 M h II 1 II IIIMI II MM MUM Ml 


Db 


456 


NTF PLLEDHTS ENXTDE KKI EERKAQ 1 1 TE K - TS P KTSNP FLVAVQDS EAD Y VTTDTLS K 514 


Qy 


524 


VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 
III 1 - 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 . 1 1 1 1 1 1 1 I it i 


583 


Db 


515 


iii i - i ii ii ii M j i If 1 1 1 1 1 1 1 1 * 1 1 1 1 1 1 1 -llllll 1 1 1 1 

VTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 574 


Qy 


584 


PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSWYESIKHEPENP 

MMhMMIIIMIMMIMM Ml II II hi II Mill hhlll Mill 

PSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENP 


642 


Db 


575 


634 


Qy 


643 


PPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 
M II II 1 : M 1 1 1 II llllh lllhlhl! 1 


702 


Db 


635 


PPYEEAMNVALKALGTKEGIKEPESFMAAVQETEAPYISIACDLIKETKLSTEPSPDFSN 


694 


Qy 


703 


YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFE 


762 


Db 


695 


Mhll h i hi IMMIIMI 1 1 

YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSI PEVPQTQEEAVMLMKESLTEVS-E 


753 


Qy 


763 


SMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELST 
1 . 1 1 1 1 1 1 1! 1 1 1 1 1 . .1 .III . . _ Mill it i 


822 


Db 


754 


I • I 1 1 I 1 1 1 * - 1 - 1 II • • • 1 1 • 1 I I 1 1 Mill : 1 

TVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDI PTLTKKEKI SLQMEEFNT 811 


Qy 


823 


AVYSNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHK 


882 


Db 


812 


IMIIMI III MMMIMIMIIIIIMIM MM II IMMMMMI 1 

AI YSNDDLSSSKEDKI KESETFSDSS PI EI I DE F PTF VSAKDDS - PKLAKEYTDLEVSDK 


870 


Qy 


883 


SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 


942 


Db 


871 


Mill II MM III Ml III 1! ::: MMIM h II = 1 MM 
SEIANIQSGADSLPCLELPCDLSFKNIYPK- -DEVHVSDEFSENRSSVSKASISPSNVSA 


928 


Qy 


943 


LGHTQAEI ES I VKPKVLEKEAEKKLPSDTEKEDRS PSAI FSADLGKTSWDLLYWRDI KK 

1 1 1 : MM 1 1 MIMIIIIIIIIMII lh MM II 1 I'll 

L-EPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKK 


1002 


Db 


929 


987 


Qy 


1003 


TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 

MIMIMMMMMMMIMMMMMMIMMMMMMMIMMMIMM 

TGWFGASLFLLLSLTVFSI VSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 


1062 


Db 


988 


1047 


Qy 


1063 


LESEVAI S E ELVQKYSNS ALGH VNCT I KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 


1122 


Db 


1048 


M M M M II 1 M 1 II II 1 II II 1 M t ! 1 1 1 1 1 1 1 1 [ 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

LESEVAI SEELVQKYSNSALGHVNST I KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 


1107 



Qy 1123 GLTLLI LALI SLFS VPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

lllllllllllllhllllllll 

Db 1108 GLTLLI LALI SLFS I PVI YERHQVQ I DHYLGLANKS VKDAMAKI QAKI PGLKRKAD 1163 

RESULT 15 
AAY71560 

ID AAY71560 standard; Protein; 974 AA. 
XX 

AC AAY71560; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat Nogo A protein fragment used in the construction of mutant NiAext . 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperprol iterative disorder; benign dysprol if erative disorder; diagnosis ; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening; mutant; mutein. 

XX 

OS Rattus sp. 
XX 

PN WO200031235-A2 . 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US26160 . 
XX 

PR 06-NOV-1998; 98US- 0107446 . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic 

PT disorders of the central nervous system and inducing regeneration of 

PT neurons - 

XX 

PS Example; Page -; 122pp ; English. 
XX 

CC The patent relates to neurite growth inhibitor Nogo which is free of 

CC all central nervous system (CNS) myelin material with which it is 

CC natively associated. Nogo proteins and fragments displaying neurite 

CC growth inhibitory activity are used in the treatment of neoplastic 

CC disease of the CNS e.g. glioma, glioblastoma, medulloblastoma, 

CC craniopharyngioma, ependyoma , pinealoma, haemangioblastoma , acoustic 

CC neuroma, oligodendroglioma, menagioma, neuroblastoma or retinoblastoma 

CC and degenerative nerve diseases e.g. Alzheimer's and Parkinson's 

CC diseases. Therapeutics which promote Nogo activity can be used to treat 

CC or prevent hyperprol iterative or benign dysprol iterative disorders e.g. 



CC psoriasis and tissue hypertrophy. Ribozymes or antisense Nogo nucleic 

CC acids can be used to inhibit production of Nogo protein to induce 

CC regeneration of neurons or to promote structural plasticity of the CNS 

CC in disorders where neurite growth, regeneration or maintenance are 

CC deficient or desired. The animal models can be used in diagnostic and 

CC screening methods for predisposition to disorders and to screen for or 

CC test molecules which can treat or prevent disorders or diseases of the 

CC CNS. The present sequence is a fragment of rat Nogo A protein shown in 

CC AAY71310, which is used in the construction of mutant NiAext . The mutant 

CC is composed of His-tag/T7-tag/vector/Nogo-A sequence aa l-974/T7-tag. 

CC Nogo A deletion mutants were used for mapping the inhibitory sites of 

CC Nogo protein. Major inhibitory region was identified in the 

CC Nogo A sequence from amino acids 172-974, particularly amino acids 

CC 542-722. In addition, N-terminal region 1-171 was found to be inhibitory 

CC to NIH 3T3 fibroblast spreading. 

CC Note: The present sequence is not given in the specification but is 

CC derived from rat Nogo A sequence shown in AAY71310. SEQ ID numbers 35-42 

CC are referred in claim 32 and SEQ ID NO: 29 in disclosure of the 

CC specification. However, the specification does not include sequences for 

CC these SEQ ID numbers. 

XX 

SQ Sequence 974 AA; 

Query Match 57.2%; Score 3388.5; DB 21; Length 974; 

Best Local Similarity 69.6%; Pred. No. 6.2e-168; 

Matches 701; Conservative 101; Mismatches 154; Indels 51; Gaps 19; 
QY 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Mhlll Mill II llllllll llllll I I : I U I.I I 

Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

QY 5 9 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

llllllll II III MhM :| MUM IMIIIIh II : I 
Db 61 PAAGLSAAAVP- -PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

Qy H9 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Ml 1 1 1 1 IIIIIIIIIIIIIIIIMII II II Illllllilllll 

Db 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Qy 179 RGSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSL 224 

Ml : Mill MlllhhlllllMIIIIIIM III 

Db 167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVljLETAASLPSL 226 

Qy 22 5 SPLSAASFKEHEYLGNLSTVljPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 284 

MM Mill IIMM I : I I I - I -MUM hi -III IMIIMI 

Db 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

Qy 285 EMGSS FS VS PKAESAVI VAN P REE 1 1 VKNKDEEEKL VSNN I LHXQQELPTALTKLVKEDE 344 

IIMM III MhM I MhlhMhh M = II II I III 

Db 287 EMGSS FKGSPKGESAILVENTKEE VI VRSKDKED-LVCSAALHSPQESPVG KEDR 340 

Qy 34 5 WSSEKAKDSFNEKRVAVEAPMREE YADFKPFERVWEVKDSKEDS - DMLAAGGKI ESNLE 4 03 

III M I III : = :| . I I, M HIM: | | |:||| [ 
Db 341 WSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANVE 396 

Qy 4 04 SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIAT 463 

Mlhll Mill : MM hi MMMI Ml I IIMM I Mill 



Db 3 97 SKVDRKCIEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTTA 4 55 

Qy 464 NI FPLLEDPTSENXTDEKKI EEKKAQI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 

I 1 1 1 1 1 1 1 1 1 1 Ml Ml I I'M || IIIUI || 1 1 1 1 MINI hi 

Db 456 NTFPLLEDHTSENKTDEKKIEERKAQI ITEK-TSPKTSNPFLVAVQDSEADYVTTDTLSK 514 

Qy 524 VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 583 

III hi M I ! 1 1 IN: ; 1 1 II 1 1 1 1 1 1 II; I ! 1 1 ^ 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 

Db 515 VTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 574 

Qy 584 PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 642 

IIIIIMMIIIIMIIIIIMIM :|lllllhlll Mill -HI Mill 

Db 575 PSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENP 634 

Qy 643 PPYEEAMSVSLKVSGI KEEI KEPENINAALQETEAPYI SIACDLI KETKLSAEPAPDFSD 702 

MMIMMMI I II M II h II h M M 1 1 1 II II M II 1 1 1 1 II I h I II h 

Db 635 PPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYI SIACDLI KETKLSTEPSPDFSN 694 

Qy 703 YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 762 

MM I 1 1 ^ I = 1 1 1 1 1 1 1 1 = 1 1 1 f 1 1 1 1 1 1 1 ! I = 1 1 1 hi IMMIIMI I I 

Db 695 YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSI PEVPQTQEEAVMLMKESLTEVS-E 753 

Qy 763 SMI EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELST 822 

- = = = =1=111 I I Mlllllh M MM IIMIMI Mill M 

Db 754 TVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDI PTLTKKEKISLQMEEFNT 811 

Qy 823 AVYSNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHK 882 

hllllll III ' I Ml I I I I || Mill I 

Db 812 AI YSNDDLLSSKEDKI KESETFSDSSPI EI I DEFPTFVSAKDDS - PKLAKEYTDLEVSDK 870 

Qy 8 83 SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 942 

Mill II MM III Ml III || ::: i H H h || : | MM 

Db 871 SEIANIQSGADSLPCLELPCDLSFKNI YPK- -DEVHVSDEFSENRSSVSKASISPSNVSA 928 

Qy 943 LGHTQAEI ES I VKPKVLEKEAEKKLPSDTEKEDRS PSA I FSADLGKT 989 

I I h MM I I 1 1 1 1 1 1 1 : 1 1 1 ! 1 1 1 Ih MM II 

Db 929 L-EPQTEMGSI VKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKT 974 



Search completed: January 22, 2004, 16:36:36 
Job time : 84.555 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



CM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



January 22, 2004, 16:31:15 ; Search time 28.5792 Seconds 

(without alignments) 
1744.001 Million cell updates/sec 

US-09-830-972-29 
5923 

1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKI QAKI PGLKRKAE 1178 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 328717 seqs, 42310858 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



328717 



Database : Issued_Patents_AA: * 

1 : /cgn2_6/ptodata/l/iaa/5A_COMB.pep : * 

2 : /cgn2_6/ptodata/l/iaa/5B_COMB.pep: * 

3 : /cgn2_6/ptodata/l/iaa/6A_COMB.pep: * 

4 : /cgn2_6/ptodata/l/iaa/6B_COMB.pep:* 

5 : / cgn2_6 /p t oda t a / 1 / iaa / PCTUS_COMB . pep : * 

6 : /cgn2_6/ptodata/l/iaa/backf ilesl .pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-08-700-607-1 

; Sequence 1, Application US/08700607 

; Patent No. 5858708 

; GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 

APPLICANT: Au-Young, Janice 

APPLICANT: Goli, Surya K. 

APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE: CA 

COUNTRY : U.S. 

ZIP: 94304 



COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700 , 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERI STI CS : 
LENGTH: 199 amino acids 
TYPE: amino acid 
STRANDEDNESS : s ingl e 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: 

CLONE: Consensus 
US-08-700-607-1 

Query Match 15.7%; Score 931; DB 2; Length 199; 

Best Local Similarity 99.0%; Pred. No. 6.9e-50; 

Matches 18 9; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 
Qy 988 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1047 

I MIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIMIMMMIIMIIIIIMI 

Db 9 KDKvVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 68 

Qy 104 8 QA I QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKF 1107 

IMIIIMIIIMIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

Db 69 QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKF 128 

Qy 1108 AVLMWVFTYVGALFNGLTLLILALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKIQ 1167 

I I M I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I || I I I I I I | I || | | | | | M 
Db 129 AVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQ 188 

Qy 1168 AKI PGLKRKAE 1178 

Illllllllll 

Db 18 9 AKI PGLKRKAE 199 



RESULT 2 
US-08-700-607-5 

Sequence 5, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 



TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE : CA 

COUNTRY : U.S. 

ZIP : 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700 , 607 

FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 

NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 

REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 776 amino acids 

TYPE: amino acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 

LIBRARY: GenBank 

CLONE: 3 073 07 
US-08-700-607-5 

Query Match 13.3%; Score 787.5; DB 2; Length 776; 

Best Local Similarity 31.9%; Pred. No. 3.1e-40; 

Matches 229; Conservative 91; Mismatches 190; Indels 207; Gaps 25; 

Qy 588 ESEATPSPVLPDI - - VMEAPLNSA VPSAGASVIQPSSSPLE ASSVNY 632 

I II I III' = I h h I : : Ih I : | 

Db 141 EELGTPGPSLPDVPGIESRGLFSSDSGIEMTPAESTEVNKILADPLDQMKAEAYKYIDIT 200 

QY 633 - -ESIKHEPENPPPYEEA MSVSLKVSGIKEEIKEPENINAAL QET 675 

I 'Ih =: I h :hl |::| I | : : : | 

°b 201 RPEEVKHQEQHHPELEDKDLDFKNKDTDI S I KPEGVREPDK- PAPVEGKI I KDHLLEEST 259 

Qy 676 EAPYISIACDL 1 KETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVED 722 

MM II : I I Ih : - h :: 

Db 260 FAPYID DLSEEQRRAPQ I TTPVKI TLTE I E PSVE TTTQEKTPEKQDI CLK 309 

Qy 723 SSPDSEPVDLFS DDS IPDVPQKQ DET VMLVKES LTETS F 761 

MM I I III | : | M : Mh 
Db 310 PSPDTVPTVTVSEPEDDSPGSITPPSSGTEPSAAESQGKGSISEDELITAIKEA 363 



Qy 



762 ESMI EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELS 821 



Db 


364 


Qy 


822 


Db 


404 


Qy 


869 


Db 


445 


Qy 


922 


Db 


497 


Qy 


957 


Db 


552 


Qy 


1002 


Db 


600 


Qy 


1062 


Db 


660 


Qy 


1122 


Db 


720 



II I II :: : 

- E VKARS GPPTIPSPLDH EA 4 03 



I I I lh: II : I II 



II : I 



922 DF SKNGSATSKVLL LPPDVSALGHTQAE I ES I VKP 956 

hi | Mill I I : : | 

REETGVRAEERAPSRRGLAEPGSFLDYPSTEPQPGPELPPGDGAL EPETPMLP 551 

LEKEAEKKLiPSDTEKEDRS PSA 1 FSADLGKTS WDLLYWRDI K 1001 

: I : -ihl =| | | :||||||||| 



— _ — - - — ' — - ^ - ------ - i-i-ij-n^ v j. j. uj, ivi j. ivvj u -L y^jn. j. i_/i-jvji l it J. rvrt. 

MHIh I II III i I hill || MINIM hlhlhlllllhl 

}TG I VFGS FLLLLF S LTQFS WS WAYLALAALSAT I S FR I YKS VLQAVQKTDEGHP FKA 

fLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 

II 1 = = 1 = 1 Hlh: :|| hlllll|||| | | | | | | | | | | | || : ■ 

'LELEITLSQEQI QKYTDCLQFYWSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGALF 

IGLTLLILALISLFSVPVI YERHQAQIDHYLGLANKNVKDAMAKIQAKI PGLKRKAE 11' 

IIMhM-hhMhl MMIM MM - MIMIMM II II 



RESULT 3 
US-08-700-607-6 

Sequence 6, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 

Au- Young, Janice 
Goli, Surya K. 
Hillman, Jennifer L. 
TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE : CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08 /700 , 607 



APPLICANT 
APPLICANT 
APPLICANT 



FILING DATE: Filed Herewith 
ATTORNEY/ AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,74 9 
REFERENCE/DOCKET NUMBER : PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 356 amino acids 
TYPE: amino acid 
STRANDEDNESS : s ingl e 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 307309 
US-08-700-607-6 

Query Match 11.9%; Score 705; DB 2; Length 356; 

Best Local Similarity 42.6%; Pred. No. 1.2e-35; 

Matches 168; Conservative 55; Mismatches 86; Indels 85; Gaps 11; 

Qy 803 EVSTLSKKEKI PLQMEELSTAVYSNDDLFI SKEAQI RETETFSDSSPI EI I DEFPTLI SS 862 

= I I == I I : ■ I I hh II II I- h : : 

Db 30 QYSILREEREAELDSELI IESCDASSASEESPKRE QDSPPMK PSALDA 77 

Qy 863 KTDSFSKLAREYTDLEVSHKSEIANAPDGAGSL PCTELPHDLSLKNIQPKVEEKISF 919 

IN : Ml || || | 
Db 78 IREETGVRAEERAPSRRGLAEPGSFLDYPSTE PQPGPE 115 

Qy 920 SDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPS 97 9 

III II I h : | : | : -|h 

Db 116 LPPGDGAL EPETPMLP RKPEEDSSSNQSPA 145 

Qy 980 A 1 FSADLGKTS WDLLYWRD I KKTG WFGASLFLLLSLTVFS I VS 1024 

I : I M MIIIIIIMMIMIh I I I Ml Ihll 
Db 146 ATKGPGPLGPGAPPPLLF LNKQKAI DLLYWRDI KQTGI VFGSFLLLLFSLTQFSWS 202 

Qy 1025 VTAYI ALALLS VTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGH 1084 

I IhMI II MINIM hlhlhlllllhllll M MM : I I I : : 

Db 2 03 WAYLALAALSATISFRIYKSVLQAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQFY 262 

Qy 1085 VNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERH 1144 

I I IMIMIMM MIMMIIMIh ! . , : M 

Db 2 63 VNSTLKELRRLFLVQDLVDSLKFAVX.MWLLTYVGALFNGLTLLLMAWSMFTLPWYVKH 322 

Qy 1145 QAQIDHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1178 

Mill MM - 
Db 323 QAQI DQYLGLVRTHI NAWAKI QAKI PGAKRHAE 3 56 



RESULT 4 
US-08-700-607-7 

; Sequence 7 , Application US/08700607 
; Patent No. 5858708 



; GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au-Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES : 9 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE : CA 

COUNTRY: U.S. 

ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/7 00,607 

FILING DATE: Filed Herewith 
ATTORNEY /AGENT INFORMATION: 

NAME: Billings, Lucy J . 

REGISTRATION NUMBER: 36,749 

REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 2 08 amino acids 

TYPE: amino acid 

STRANDEDNESS : s ingl e 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 

LIBRARY: GenBank 

CLONE: 307311 
US-08-700-607-7 

Query Match 11.6%; Score 688; DB 2; Length 208; 

Best Local Similarity 67.5%; Pred. No. 5.9e-35; 

Matches 129; Conservative 32; Mismatches 30; Indels 0; Gaps 0; 
Qy 988 KTSVVDLLYWRDIKKTGVVFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1047 



Db 




Qy 



104 8 QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKF 1107 



■ I I M I ' * I * I •Ml- - I I I : I I E I I I I I I II M M It 

78 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKF 13 





Db 



QY 




Db 



Qy 1168 AKI PGLKRKAE 1178 

Mill II II 

Db 198 AKI PGAKRHAE 2 08 



RESULT 5 
US-08-700-607-8 

Sequence 8, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT : Bandman , Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES : 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700,607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 8: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 267 amino acids 
TYPE: amino acid 
STRANDEDNESS : s ingl e 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 281046 
US-08-700-607-8 

Query Match 11.3%; Score 671; DB 2; Length 267; 

Best Local Similarity 66.3%; Pred. No. 9.4e-34; 

Matches 124; Conservative 34; Mismatches 29; Indels 0; Gaps 0; 

QY 988 KTSWDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLSVTI SFRI YKGVI 1047 

' . I II III Ihlll Ihlll II llllllll h 

Db 9 KSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVL 68 



Qy 1048 QAIQKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKF 1107 

Ihlhlllllhllll h :hl :||h: : I I , 

Db 69 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDLVDSLKF 128 

Qy 1108 AVL^VFTYVGALFNGLTLLIIALISLFSVPVIYERHQAQIDHYLGLANKNVK1)AMAKIQ 1167 

MMh llllllllllll|::|::|:|::||:| :|||h! |||| :: , 
Db 12 9 AVLMWLLTYVGALFNGLTLLLMAWSMFTLPVVYVKHQAQVDQYLGLVRTHINTWAKIQ 18 8 

Qy 1168 AKIPGLK 1174 

Mill 

Db 189 AKIPGAR 195 



RESULT 6 

US-09-149-476-563 

; Sequence 563, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al. 

; TITLE OF INVENTION: 186 Human Secreted proteins 
; FILE REFERENCE: PZ002P1 

; CURRENT APPLICATION NUMBER: US/09/ 14 9 , 476 

; CURRENT FILING DATE: 1998-09-08 

; EARLIER APPLICATION NUMBER: PCT/US98/04493 

; EARLIER FILING DATE: 1998-03-06 

; EARLIER APPLICATION NUMBER: 60/040 , 162 

EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/04 0,333 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER : 60/038,621 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,626 

EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,334 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/04 0,33 6 

EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,163 

EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/047,600 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,615 

EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,597 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,502 

EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,633 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,583 

EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,617 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,618 

EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,503 



EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,5 98 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 



EARLIER APPLICATION NUMBER : 60/056,889 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,893 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,910 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,586 
EARLIER FILING DATE : 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,594 



EARLIER FILING DATE : 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,589 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,593 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,614 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/043,578 

EARLIER FILING DATE: 1997-04-11 
; EARLIER APPLICATION NUMBER: 60/043,576 

EARLIER FILING DATE: 1997-04-11 
; EARLIER APPLICATION NUMBER: 60/047,501 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/043,670 

EARLIER FILING DATE: 1997-04-11 

EARLIER APPLICATION NUMBER: 60/056,632 

EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,664 

EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER: 60/056,876 
; EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,881 

EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,909 

EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER: 60/056,875 

EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER: 60/056,862 

EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER : 60/056,887 

EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER: 60/056,908 

EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER: 60/048,964 
; EARLIER FILING DATE: 1997-06-06 
; EARLIER APPLICATION NUMBER: 60/057,650 
; EARLIER FILING DATE: 1997-09-05 
; EARLIER APPLICATION NUMBER: 60/056,884 

EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER: 60/057,669 

EARLIER FILING DATE: 1997-09-05 
; EARLIER APPLICATION NUMBER: 60/049,610 
; EARLIER FILING DATE: 1997-06-13 
; EARLIER APPLICATION NUMBER: 60/061,060 

EARLIER FILING DATE: 1997-10-02 

Query Match 9.1%; Score 541.5; DB 4; Length 168; 

Best Local Similarity 60.5%; Pred. No. 4.2e-26; 

Matches 101; Conservative 34; Mismatches 31; Indels 1; Gaps 1; 

Qy 1013 LLLSLTVFS I VS VTAYI ALALLSVTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEE 1072 

: I I I I I!--;" :h IMIMMIMIII lllhllhllllhlll: :| | 

1 MLLSLAAFSVI S WS YLI LALLSVTI SFRI YKSVT QAVQKSEEGHPFKAYLDVDI TLSSE 60 

Qy 1073 LVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALI 1132 

I hh hi :| : llllhlllllM || Ih llllhllhllllll : 
Db 61 AFHNYMNAAMVHINRALKLI I RLFLVEDLVDSLKLAVFMWLMTYVGAVFNGI TLL I LAEL 12 0 



Qy 1133 SLFS VPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGL - KRKAE 1178 

M I I I : : I I : : : llllhhl | : |||||:||: |:||| 

Db 121 L I FSVP I VYEKYKTQI DHYVGI ARDQTKS I VEKI QAKLPG I AKKKAE 167 



RESULT 7 
US-08-700-607-3 

Sequence 3, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES : 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE : CA 
COUNTRY : U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/700 , 6 07 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,74 9 
REFERENCE/DOCKET NUMBER : PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-8 55-0555 
TELEFAX: 415-84 5-4166 
INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 241 amino acids 
TYPE: amino acid 
STRANDEDNESS : s ingl e 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 

LIBRARY: THP1NOB01 
CLONE: 31870 
US-08-700-607-3 



Query Match 8.7%; Score 513; DB 2; Length 241; 

Best Local Similarity 47.7%; Pred. No. 4e-24; 

Matches 102; Conservative 37; Mismatches 55; Indels 20; Gaps 1; 

QY 963 AEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSI 1022 

Ml 1= : H lh llhllll ||| :| :|||| ||: 



Db 



20 AEPSAPGGGGSPGACPALGTKSCSSSCAVHDLIXWRDVKKTGFVFGTTLIMLLSLAAFSV 79 



Qy 1023 VSVTAYI ALALLS VTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSAL 1082 

: I I : I : MIMIIIIIIIII lllhlihllllhlll: - U I I 
Db 8 0 I SWS YLI LALLS VTI SFRI YKS VI QAVQKSEEGHPFKAYLDVDI TLSSEAFHNYMNAAM 13 9 

Qy 1083 GHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFSVPVI YE 1142 

IH : I = llllhlllllll II lh ' : : 

Db 140 VH I NRALKLI I RLFLVEDLVDSLKLAVFMWLMTYVGAVFNG I TLL I LAELL I XS VP I VYX 199 

Qy 1143 RHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRK 1176 

Db 2 00 KY KVPSKTPWNRQK 213 

RESULT 8 

US-08-714-741-32 

Sequence 32, Application US/ 08 7 14741 
Patent No. 6500613 
GENERAL INFORMATION: 

APPLICANT: Briles, David E. 
APPLICANT: McDaniel, Larry S. 
APPLICANT: Swiatlo, Edwin 
APPLICANT: Yother, Janet 
APPLICANT: Crain, Marilyn J . 
APPLICANT: Holl ingshead , Susan 
APPLICANT: Tart, Rebecca 
APPLICANT: Brooks -Walter , Alexis 

TITLE OF INVENTION: PNEUMOCOCCAL GENES , PORTIONS THEREOF, 

TITLE OF INVENTION: EXPRESSION PRODUCTS THEREFROM, AND USES OF SUCH GENES, 
TITLE OF INVENTION: PORTIONS AND PRODUCTS 
NUMBER OF SEQUENCES: 47 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Curtis, Morris & Safford, P.C. 
STREET: 53 0 Fifth Avenue 
CITY: New York 
STATE: New York 
COUNTRY : U.S. 
ZIP: 10036 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/714,741 
FILING DATE: 16-SEP-1996 
CLASSIFICATION: 435 
ATTORNEY/ AGENT INFORMATION: 

NAME: Frommer Esq., William S. 
REGISTRATION NUMBER: 25,5 06 
REFERENCE/DOCKET NUMBER: 454312-2460 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (212) 840-3333 
TELEFAX: (212) 840-0712 
INFORMATION FOR SEQ ID NO: 32: 
SEQUENCE CHARACTERISTICS: 



LENGTH: 8 991 amino acids 
TYPE: amino acid 
STRANDEDNESS : s ingl e 
TOPOLOGY: linear 
MOLECULE TYPE: amino acid 
US-08-714-741-32 

Query Match 5.3%; Score 316; DB 4; Length 8 991; 

Best Local Similarity 21.4%; Pred. No. 8.4e-10; 

Matches 248; Conservative 138; Mismatches 479; Indels 296; Gaps 52; 

Qy 6 QS PLVSSSDS P - PRPQPAFKYQF - VREPEDEEEEEEEEEEDEDEDLEE - - LEVLERKPAA 61 

-I : I hi II I = :h h :::: III || : :::| 

Db 7193 EAPAEQPKPAPAPQPAPAPKPEKPAEQPKPEKTDDQQAEEDYARRSEEEYNRLTQQQPPK 72 52 

Qy 62 GLSAAPVP TAPAAG APLMDF 81 

Db 7253 AEKPAPAPKTGWKQENGMWYFYNTDGSMGEQAGQYRAA^ 7312 

Qy 82 GNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPED DE 138 

h III II III || hi MM I :: =| :| 

Db 7313 VNEPEKPAPAPETPA- -PEAPAEQPK- -PAPAPQPAPAPKPEKPAEQPKAEKTDDQQAEE 7368 

Qy 13 9 PPAR PPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXX 18 9 

M II : I I Mill I | : | : 

Db 736 9 DYARRSEEEYNRLTQQQPPKAEKPAPAPKPEQPAPAPKNSKGEQAEQYRSAAGGDLAAKQ 7428 

Qy 190 XKI MDLKEQPGNTISAGQEDFPSVLLET-AASXPSLSPLSAASFKEHEYLGNLSTV 244 

Mh : : = |= || | |= | | : : 
Db 742 9 VELEKTEADLKK AVNE P E KPAPAP ET PAP EAPAEQ PKPA PA PQ 7471 

Qy 245 LPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVAN 304 

I I : I I : I I = I MM-: : MM 

Db 74 72 -PAPAPKPEKPAEQPK- -AEKPADQQAEEDYDRRSEEEYNRL- -TQQQPPKAEKPAPAPQ 7526 

Qy 3 05 PREEI IVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNE 357 

I : : I h : III || I =|= I : I 

Db 7527 PEQPAPAPKSLKEIDESDSEDYVKEGFRAPLQSELDAKQAKLSKLEEL SDKIDE 758 0 

Qy 358 KRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQ 417 

= I : = = III || || : : | : || ||| 
Db 7581 LDAEI -AKLEKDVEDFK XSDGEQAGQYLAAAEEDLI AKKA ELEQ 7623 

Qy 418 TNHEKDSESSNDDTSFPS TPEGI KDRSGAYITCAPFNPAATESIATNIFPLLEDPTS 474 

I 11 = = == - I = II Mill I 

Db 7624 T- -EADLKKAVNEPGKPAPAPAPE TPAPEAPAEQPK PAP 7660 

Qy 475 ENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYV--TTDNLTKVTEEWA-- 530 

I I h I II I I =| II : : ::| = : | 

Db 7 661 ETPAPAPKPEKPAEQPKPEK PADQQAE E D YARR S E E E YNRLTQQQ PAPA 7709 

Qy 531 NMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 584 

II : | :M: - : | | | || = : = | 

Db 7710 QKPEQPAKPEKPAEEPTQPEKDAEIAKLE-KNVEYFKKTDAEQTEQYLAAAEKDLADKKA 7768 

Qy 585 SFEESEA TPSPV-LPDIVMEAPLNSAVPSAGASVIQPSSSPLEAS 628 

l = = M 11 = 1 h I I : I I :| : : : 



Db 776 9 ELEKTEADLKKAVNEPEKPAEETPAPAPKPEQPAEQPKPAPAPQP-APAPKPEKTDDQQA 7827 

Qy 629 S VNYES I KHEPENPPPYEEAMSVSLKVSGI KEE 1 KEPENINAALQETEAPYI S IAC 684 

: I III- II : III || : I 

Db 7828 EEDYARRSEEEYNRLPQQQPPKAEKPAPAPKPEQPVPAEXPEN PAPAPKPAXAP 7881 

Qy 68 5 DLI KETKLSAEPAPDFSDYSEMAKVEQPV PDHSELVEDSS PDS 727 

= 1 : : =1 h I hi II = II : I 

Db 7882 QPLKPEEPAEQPKPE KPEEPAGQPEPEKPDDQQAGEDYARRSGGEYNRFPQQ 7933 

Qy 728 E P VDLFSDDS I PDVPQKQDETVMLVKESLTETS FESMI E YENKEKLSALP PEGG 781 

H | | :: | | | | ||: 

Db 7934 QPPKAEKPAPAP KPEQPVPAPKTLL KKAKLAGAKSKAATKKAEL 7977 

Qy 782 KPYLESFKLSLDNTKDTLLP DEV STLSKK-EKI PLQMEELSTAVYSNDD 829 

: I II = hi II I Ih : hll I :| h II : : I 

Db 7978 EPELEKAEAELENLLSTLDPEGKTQDELDKEAAEAELNKKVEALPNQVSELEEELSKLED 8037 

Qy 830 LFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLARE YTDLEVSHKSEI 885 

hh I : I : l-l I ■ I : | . I 

Db 8038 NL - - KDAETNNVEDY IKEGLEEAIATKQAELEKTPKELDAALNELGPDGDEEE 8088 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKI - -SFSDDFSKNGSATSKVLLLPPDVSAL 943 

Db 8089 TPPPEAPAEQPKPEKPAE-ETPAPAPKPEKSADQQAEEDYARRSEEEYNRL 8138 

Qy 944 GHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRD 1 1000 

M : II Ml h I - I | | =:| | 

Db 8139 --TQQQ PPK AEKPAPAPAPKPEQPAPAPKSRG1ATKKKLNLAEARIELLL 8186 

Qy 1001 KKTGV VFGASLFLLLS 1016 

II I: II I III 

Db 818 7 KKLGLEPGLEKAGAGLGNLLS 82 07 



RESULT 9 

US-09-149-476-411 

; Sequence 411, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

APPLICANT: Rosen et al . 

; TITLE OF INVENTION: 186 Human Secreted proteins 
FILE REFERENCE: PZ002P1 

; CURRENT APPLICATION NUMBER: US/09/149,476 
CURRENT FILING DATE: 1998-09-08 
EARLIER APPLICATION NUMBER: PCT/US98 / 044 93 

; EARLIER FILING DATE: 1998-03-06 

; EARLIER APPLICATION NUMBER: 60/040,162 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,333 
EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/038,621 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,626 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,334 
EARLIER FILING DATE: 1997-03-07 



EARLIER APPLICATION NUMBER: 60/04 0,336 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,163 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/047,600 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/04 7,615 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,597 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,502 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/04 7,633 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,583 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,617 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,618 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,503 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER : 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 



EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 
EARLIER 



FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 



: 1997-04-11 

NUMBER: 60/043,671 
: 1997-04-11 

NUMBER: 60/043,674 
: 1997-04-11 

NUMBER : 60/043,669 
: 1997-04-11 

NUMBER: 60/043,312 
: 1997-04-11 

NUMBER: 60/043,313 
: 1997-04-11 

NUMBER: 60/043,672 
: 1997-04-11 

NUMBER: 60/043,315 
: 1997-04-11 

NUMBER: 60/048,974 
: 1997-06-06 

NUMBER: 60/056,886 
: 1997-08-22 

NUMBER: 60/056,877 
: 1997-08-22 

NUMBER: 60/056,889 
: 1997-08-22 

NUMBER: 60/056,893 
: 1997-08-22 

NUMBER: 60/056,630 
: 1997-08-22 

NUMBER: 60/056,878 
: 1997-08-22 

NUMBER: 60/056,662 
: 1997-08-22 

NUMBER: 60/056,872 
: 1997-08-22 

NUMBER: 60/056,882 
: 1997-08-22 

NUMBER: 60/056,637 
: 1997-08-22 

NUMBER: 60/056,903 
: 1997-08-22 

NUMBER: 60/056,888 
: 1997-08-22 

NUMBER: 60/056,879 
: 1997-08-22 

NUMBER: 60/056,880 
: 1997-08-22 

NUMBER: 60/056,894 
: 1997-08-22 

NUMBER: 60/056,911 
: 1997-08-22 

NUMBER: 60/056,636 
: 1997-08-22 

NUMBER: 60/056,874 
: 1997-08-22 

NUMBER: 60/056,910 
: 1997-08-22 

NUMBER: 60/056,864 
: 1997-08-22 



EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,586 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,594 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,589 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,593 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,614 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,578 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,576 
EARLIER FILING DATE : 1997-04-11 
EARLIER APPLICATION NUMBER: 60/047,501 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,670 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/056,632 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,664 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,876 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,881 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,909 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,875 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,862 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,887 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,908 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/048,964 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/057,650 



EARLIER FILING DATE: 1997-09-05 

EARLIER APPLICATION NUMBER: 60/056,884 

EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/057,669 

EARLIER FILING DATE: 1997-09-05 
; EARLIER APPLICATION NUMBER: 60/049,610 

EARLIER FILING DATE: 1997-06-13 
; EARLIER APPLICATION NUMBER: 60/061,060 

EARLIER FILING DATE: 1997-10-02 

Query Match 4.8%; Score 285; DB 4; Length 92; 

Best Local Similarity 57.1%; Pred. No. 9.8e-ll; 

Matches 52; Conservative 20; Mismatches 19; Indels 0; Gaps 0; 

Qy 1079 NSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALI SLFSVP 1138 

M: hi :| : ill! || ||: | | | | | : | | | : | | | | | | : :|||| 

Db 2 NAAMVH I NRALKL 1 1 RLFLVEDLA^DSLKLA VFMWLMTY VGAVFNGI TLLI LAELL I FS VP 61 

Qy 113 9 VI YERHQAQ I DH YLGLANKNVKDAMAKI QAK 1169 

-I!-: Illlhhl | : || :| 

Db 62 I VYEKYKTQI DHYVGI ARDQTKS I VEKI PSK 92 



RESULT 10 
US-08-973-462-8 

; Sequence 8, Application US/08973462B 

; Patent No. 6191270 

; GENERAL INFORMATION: 

; APPLICANT: DRUILHE, PIERRE 

; APPLICANT: DAUBERS I ES , PIERRE 

; TITLE OF INVENTION: MALARIAL PRE -ERYTHROCYTIC STAGE POLYPEPTIDE MOLECULES 

FILE REFERENCE: 0660-0125-0 PCT 
; CURRENT APPLICATION NUMBER: US/08 / 973 , 4 62B 
; CURRENT FILING DATE: 1998-02-06 

EARLIER APPLICATION NUMBER: PCT/FR96/00894 
; EARLIER FILING DATE: 1996-06-12 

EARLIER APPLICATION NUMBER: FR 95/07007 
; EARLIER FILING DATE: 1995-06-13 

NUMBER OF SEQ ID NOS : 2 9 

SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 8 

LENGTH: 17 8 6 
TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence : Polypeptide 
US-08-973-462-8 

Query Match 4.7%; Score 276.5; DB 3; Length 1786; 

Best Local Similarity 20.1%; Pred. No. 2.2e-08; 

Matches 232; Conservative 201; Mismatches 471; Indels 249; Gaps 50; 

QY 3 0 EPEDEEEEEEEEEEDEDEDLEE LEVLERKPAAGLSAAPVPTAPAAGAPLMDFGN 83 

I M : I I lh : I : : I I :| |: : : : || :: 
D b 228 EESVEENDEESVEENVEENVEENDDGSVASSVEESIASSVDESIDSSIEENVAPTVE 284 



QY 84 DFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPAR- 142 



■ I h III : I = : I III :| | 

Db 28 5 EIVAPS WESVAPSVEESVEENVEESVAENVEESVAENVEESVAENVEESVAEN 338 

Qy 143 PPPPPPASVSPQAEPWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQ 198 

I hi I : I II : = II :::| 

Db 339 VEEIVAPTVEEI VAPTVEEI VAPSVVESVAPSVEESVEENVEESVAENVEESVAENVEES 398 

Qy 199 PGNTI SAGQEDFPSVLLETAAS XPSLSPLSAASFKEHEYLGNLSTVLPT- EGTL 2 51 

: :| : |: | |:: : | : :| : :| |: | :: 

Db 399 VAENV EES VAENVEESVAENVE EI VAPTVEEI VAPTVEEI VAPSWESVAPSVEESV 455 

Qy 252 QENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVI VANP 305 

: I I I h I ■■■■ ■ \ I :| lh I :| || 

Db 456 EENVEESVAENVEESVAENVEESVAENVEESVAE-NVEESVAENVEESVAENVEESVAEN 514 

Qy 306 REEI IVKNKDE EEKLVSNNI LHX QQELPTALTKLVKED EW 346 

Db 515 VEEIVAPTVEEI VAPTVEEI VAPSWESVAPSVEESVEENVEESVAENVEESVAENVEES 574 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEYA DFKPFERVWEVKDSKEDSDMLAAGGKIESN 4 01 

= 1-11 : II II = I I I I - I : I I 

Db 575 VAENVEESVAENVEEIVAPTVEEIVAPTVEEIVAPSWESVAPSVEES VEEN 626 

Qy 4 02 LESKVDKKCFADSLEQTNHE- -KDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATE 459 

Db 62 7 VEESV AENVEESVAENVEESVAENVEE I VAPTVEEI VAPTVEEI VA PSWE 677 

Qy 460 S I ATNI F PLLEDPTS ENXTDE - - KKI EEKKAQI VTEKNTSTKTSNPFFVAAQDS ETD YVT 517 

Db 678 SVA PSVEESVEENVEESVAENVEESVAENVEESVAENVEES VA EN VE E I VA P T 730 

Qy 518 TDNLTKW-EEWA-NMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTS--EVMQ 573 

• • I Ihll - I = I hh I = I : |: : |: | | :: 

Db 731 VEEIVAPTVEEI VAPSWESVAPS -VEESVEENVEESVAENVE-ESVAENVEESVAENVE 788 

Qy 574 ESLYPAAQ- -LCPSFEESEATPSPVLPDI VMEAPLNSAVPSAGASVIQPSSSPLEASSVN 631 

Db 789 ESVAPTVEEIVAPSVEESVA-PS VEESVAENVATNLSDNLLSNLLGGI ETEEI K 841 

Qy 632 YESIKHEPENPPP YEEAMSVSLKV- SGI KEEI KE PENINAALQETEA 677 

= 11 =1 I II : 1= I I llhl : I hi 

Db 8 4 2 - DS I LNE I EE VKENWTTI LENVEETTAES VTTFSN I LEE I QENTI TNDTI EEKLEELHE 900 

Qy 678 PYISIA CDLIKETKLSAEPAPDFSDYSEMA- -KVEQPVPDHSELVED 722 

= 1 I hhl IN =111 : : I =| = 

Db 901 NVLSAALENTQSEEEKKEVIDVIEEVK- -EEVATTLIETVEQAEEKSANTITEIFENLEE 958 

Qy 723 SSPDS EPVDLFSDDSI PDVPQKQDETVMLVKESLTET 759 

Db 959 NAVESNENVAENLEPCLNETVFNTVLDKVEETVEI SGESLENNEMDKAFFSEI FDNVKGI Q 1018 

Qy 760 SFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKK 810 

I h I -:|h : : I h II :| 

Db 1019 ENLLTGMFRS I ETS I VI QSEEKVDL NENWSSILDNIENMKEGLL-NKLENI SST 1072 

Qy 811 E KIPLQMEE LSTAVYSNDDLF 831 



Db 1073 EGVQETVTEHVEQNVYVDVDVPAMKDQFLGILNEAGGLKEMFFNLEDVFKSESDVITVEE 1132 

Qy 832 ISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSK1AREYTDLEVSHKSEIA 886 

= II = =11 = hi II I - I I 

Db 1133 I KDEPVQKEVE - KETVS 1 1 EEMEENI VDVLEEEKEDLTDKM I DAVEES I E I SSDSKEETE 1191 

QY 887 NAPDGAG- -SLPCTELPHD LSLKNIQPK-VEEKISFSDDFSKNGSATSKVL 934 

= 1 II h : I |||:: : : = : : :| |||:: 
Db 1192 SIKDKEKDVSLWEEVQDNDMDESVEKVLELKNMEEELMKDAVEIND ITSKLI 1244 

Qy 935 LLPPDVSALGHTQAEIESIVKPKVLEKEAEKKLPSDTE KEDRS PSA I FSADLGKT 98 9 

= I : I : = = l II II I I- hi | | 

Db 1245 EETQELNEVEAD LI KDMEKLKELEKALSEDSKEI IDAKDDTLEKVIEEEHDITT 1298 

Qy 990 SWDLLYWRDIKK 1002 

Db 12 99 TLDEWELKDVEE 1311 



RESULT 11 
US-09-345-473E-40 

; Sequence 40, Application US/09345473E 

; Patent No. 6558903 

; GENERAL INFORMATION: 

; APPLICANT: Hodge , Martin 

; TITLE OF INVENTION: No. 6558903el Kinases and Uses Thereof 
FILE REFERENCE: 35800/183781 

CURRENT APPLICATION NUMBER: US/09/345,473E 
CURRENT FILING DATE: 1999-06-30 
; NUMBER OF SEQ ID NOS : 62 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 40 

LENGTH: 1601 

TYPE : PRT 

ORGANISM: C. elegans 
US-09-345-473E-40 

Query Match 4.5%; Score 267.5; DB 4; Length 1601; 

Best Local Similarity 20.5%; Pred. No. 6.8e-08; 

Matches 229; Conservative 134; Mismatches 376; Indels 377; Gaps 56; 

Qy 2 9 REPEDEEEEEEEEE EDEDEDLEELEVLERKPAAGLSAAPVPTAPAAGA 76 

I H Ml | | | ||: | ||:| || :| | || 
Db 462 RAKEEEERIREEAEIKEELRLRAEAKEKEKERLEK-ERLEKKAAAAAAANPNPTP 515 

Qy 77 PLMDFGNDFVPPAPRGP LPAAPPVAPE RQPSWDPS - PVS - - STVPAPSP 122 

= 1111 III-: = I = I I I I = = = I | | | 

Db 516 IPPTPATPHSSAQQQPIPPPLSTQTSAEIQQSAQQPSVPVTMIANIPAMSP 566 

Qy 123 LSA AAVSPSK LPEDDEPPARPPPPPPASVSPQAEPVW TPPAPAP- -AAPP 170 

M : I I : :| :| | :|: | || ||:||| 

°b 567 TSAQPQPVLSPTSAAVPVPTTMIHVPKPSEI PVQNVATTAAPVAANNVPPSPAPFKTEDI 626 

Qy 171 STPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAG-QEDFPSVLLETAASXPSLSPLSA 22 9 

II ■ Ih I I I I III : 

Db 627 QTPTLAQ NTVPRTISTDASGLVINTPASIASPSPAPS 663 



Qy 


230 


ASFKEHEYLGNLSTVLP TEG TLQENVSEASKEVSEKAKTLLI DRDLT 

1 • : : 1 1 hi 1 1 1 1 1 1 " • 1 " 


276 


Db 


664 


i ii II l l 1 1 1 | ■ ■ 1 

AT DVASTTAPVTPAPTPTTTTDGGAAAASTTTENKEEKRKSNKRKWMEILGCDES 


719 


Qy 


277 


E FSELEYSEMGSSFSVSP KAESAVI VANPREEI I VK 


312 


Db 


720 


RNFALVSCRLDTSHKSVTFQFAPGTDKPCTIATKLLAEDCLLKVHVHIVEAQLGEVIQLI 


779 


Qy 


313 


NKDEEE KLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMRE 


368 


Db 


780 


NS DGKKG VGTKLAT - - VLD PNSTEPPTI TAVM PKD S SAATASNTKP KIEI 


827 


Qy 


369 


EYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSN 428 
1 . . 1 . . . .... I I I I i I 


Db 


828 


i • . | . . . . ... || | | . || 

EKTPPTRDASQE PNNVQVTNVRKVSQESN 


856 


Qy 


429 


DDTSFPSTPEGI KDRSGAYI TCAPFN - - PAAT- ES 


460 


Db 


857 


: =11 1 II 1 

AE-SVQSIP R PGG 1 1 VMS PTNQTDS A P P PTGAAAKPS RFQ VTKSAD P I ATP I S S S 


910 


Qy 


461 


I AT -NI FPLLEDPTSENXTDEKKI EEK- KAQI VTEKNTSTKTSNPFFVAAQDSETDYVTT 
hi : h = llllh lh:| 1 = h : :| | 


518 


Db 


911 


ISTATVIPIVA-ATPTNITSEPVIVQPITAQVITHLATPSPVSHSL SSNSSPSATTH 


966 


Qy 


519 


DNLTKVTEEWANMP-EGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLY 
1- : : -1 I I =| || := | | 


577 


Db 


967 


SNMSS I - - QSTTSVPGRRFTVQPVSQA- ESGI SSS I STPHPEPT 


1007 


Qy 


578 


PAAQLCPSFEESEATPSPVLPDIVMEAPLNSAV PSAGASVIQPSSS PLEAS SVNY 


632 


Db 


1008 


N II II H 1 II 1 III : 1 Ih 

PAITSCP PPVPSVPPWSNGTLNLEVAPKQTPSATNQNVDTQHSSSTASTATL 


1060 


Qy 


633 


ESIKHEPEN PPPYEEAMSVS LKVSG I KE E I KE P EN I NAA 


671 


Db 


1061 


: 1 1 1 :| : : lllh 1 : : 

--VSETPATVHVTPISVPAPVQEPLVIDHHSDVLTQLDSELRKVSGVSHS-ASPSTWES 


1117 


Qy 


672 


LQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVD 


731 


Db 


1118 


LTSMTPQTI PLACQTV- PAS IGQAPAVI AAAHA AS 


1151 


Qy 


732 


LFSDDS I PDVPQKQDETVMLVKESLTETSFESMI E YENKEKLSALPPEGGKPYLESFKLS 


791 


Db 


1152 


1 = 1 = 1 1 = 1 II ' h 

LIPNASVPQSPSRLD AETGLAGL-- HEKLEAL KME 


1184 


Qy 


792 


LDNTKDTLLPDE- - -VSTLSKKEKI PLQ MEELSTAVYSNDDLFISKEAQIRET- - 


841 


Db 


1185 


1 =1 = h U h:|h II :-: ||| 
QDRRED- - MGDDA I GTTTTDGKDE I P I DTLKGLAEALGKVI HADG RETTP 


1232 


Qy 


842 


ETFSDSSPIEII DEFPTLISSKTDSFSKLAREYTDLEVS- -HKSEIANAP 


889 


Db 


1233 


: 1 : 1 = = l | |: |: | | III | : : | 
MPPDHPDLTDASTQQLISPSNPDVLTTMSSAVEGSASSTMIEDIDASTSAVDASMMNSMP 


1292 


Qy 


890 


DGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAE 
M :| :: :: | | | || : : | | 


949 


Db 


1293 


PGA QNSTDQI PAAMTLSMDQECAQSMTSSITR NTTGTKLAT 


1333 


Qy 


950 


IESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSAD 98 5 





h: III I hi I I 

Db 1334 FENL ETALSSTLGTHI RQPNAPSSRD 1359 



RESULT 12 
US-08-978-277A-4 

; Sequence 4, Application US/08978277A 

; Patent No. 6582956 

; GENERAL INFORMATION: 

APPLICANT: Gelman, Irwin H. 

TITLE OF INVENTION: TUMOR SUPPRESSOR GENE 
NUMBER OF SEQUENCES: 2 0 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Brumbaugh, Graves, Donohue & Raymond 

STREET: 30 Rockefeller Plaza 

CITY: New York 

STATE : NY 

COUNTRY : USA 

ZIP: 10112-0228 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/ 978 , 277A 

FILING DATE: 

CLASSIFICATION: 514 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/665,401 

FILING DATE: 18-JUN-1996 
ATTORNEY /AGENT INFORMATION: 

NAME: Clark, Richard S 

REGISTRATION NUMBER: 26,154 

REFERENCE/DOCKET NUMBER: A30558 - 165/34008 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 212-408-2558 
TELEFAX: 212-765-2519 
TELEX : 

INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1596 amino acids 

TYPE: amino acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
HYPOTHETICAL: NO 
ANTI -SENSE: NO 
FRAGMENT TYPE: internal 
ORIGINAL SOURCE: 
US-08-978-277A-4 

Query Match 4.5%; Score 265; DB 4; Length 1596; 

Best Local Similarity 21.3%; Pred. No. 9.6e-08; 

Matches 232; Conservative 145; Mismatches 422; Indels 288; Gaps 47; 
QY 2 EDLD- -QSPLVSSSDSPPRPQPAFKYQFVREPEDE EEEEEEEEEDEDEDLEELEV 54 



H: H I I I I I I :|| ||:: || : : | 

62 0 EELEKVKSATLSSTDST VSEMQDEVKTVGEEQKPEEPKRRVDTSVSWEA 668 

55 LERKPAAGLSAAPVPTAPAAGAPL MDFGNDFVPPAPR 91 

669 LICVGSSKKRARKASSSDDEGGPRTLGGDSHRAEEAS 728 

92 -GPLPAAPPVAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPAS 150 

I M I I =h I H MINI: 
72 9 SSPEPAGSPSEGEGVSTWE SFKRLVTPRKK SKSKLEEKAEDSS- -VEQLSTE 778 

151 VSPQAEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAG 206 

= 111 I hi I h: :| 

779 IEPSREESWV S I KKF I PGRRKKRA DGKQEQATVEDSGPVEI 819 

207 QEDFPSVXiLETAASXPSLSPLSAASFKEHEYL GN LSTVLPTEGTLQENVSE 257 

II hi h: III : I I : || I | 
820 NEDDPNV PAWPLSEYNAVEREKMEAQGNTELPQLLGAVYVSE E 863 

258 ASKEVSEKAKTLLID- -RDLTEFSELEYSEMGSSFSVSPKAESAVI VANPREEI I VKNKD 315 

M : : I I I : I I I : :| : | :| | |h h 

864 LSKTLVHTVSVAVIDGTRAVTSVEERSPSWISASVT-EPLEHTAGEAMPPVEEVTEKDII 922 

316 EEEKLVSNNILHXQQELPTALTKLVKE DEWS S EKAKDS FNE KR VAVEAPMREE Y 370 

I I I lh : I h:|:|| I || | 

923 AEETPV LTQTLP EGKDAHDDM VTS E VDFTS EAVTATET 960 

371 ADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDD 43 0 

- I I I = = l I I — I == = h h - II I 

961 S EALRTEE VTEASGAEETTDMVSAVSQLTDS PDTTEEATPVQEVESGVLD 1010 

431 TS FPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEK 481 

I = : :h I lh- II 
1011 TEEEERQTQAILQAVADKVKEES QVPATQTVQRTGS KALE KVE 1053 

482 KIEEKKAQI VTEKNTSTKTSNPFFVA AQDSETDYVTTDNLTKVTEEWANMPEGL 536 

-II : :|| | | || Ml | || | 

1054 E VEEDS E VLAS E KE KDVMP KG P VQEAGAEHLAQGS ETGQAT PESL 1098 

537 TPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPV 596 
h III : : ::|:::: I : : :|| | 

1099 EVPEVT-ADVDHVATCQVIKLQQLMEQAVAPESS--ETLTDSETNGSTP 1144 

597 LPD 1 VMEAPLNSAVPSAGASVI QPSSS PLEAS SVNYES I KHEPENPPPYEEAMS 650 

I I : -I I hi I ■ lh-- I I I I I 

1145 LADSDTADGTQQDETIDSQDSKATAAVRQSQVTEEEAATAQKEEPSTLPNNVPAQE 1200 

651 VSLKVSGIKEEIKEPE^JINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE^4AKVE 710 
I Ml h |= : | - :|:: | |:| | || 

1201 EHGEEPG- -RDVLEPTQQELTAAAVPVLAKTEVGQEGEVDWLD-GEKVKEE 1248 

711 QPVPDHSELVEDSS PDSE - PVDLFSDDS I PDVPQKQD - ETVMLVKESLTETS FESM I EYE 768 
I I I I hh h I = I h h : Ml |: :| | 

124 9 QEVFVH SGPNSQKAADVTYDSEVMGVAGCQEKESTEVQSLSLEEGEMETDVEKE 13 02 



76 9 NKE - KLSALPPEG 



GKPYLESFKLSLDNTK 



DTLLPDEVSTL 807 



Db 



1303 KRETKPEQVSEEGEQETAAPEHEGTYGKPVLTLDMPSSERGKALGSLGGSPSLPDQ 1358 



Qy 



8 08 SKKEKI PLQMEELSTAVYSNDDLFISKEAQIRETETFSDS- -SPI EI IDE- -FPTLISSK 863 

I I :h: III : - II h= II I = III 

1359 DKAGCI EVQVQSLDTTVTQTAEAV EKVI ETWI SETGESP - ECVGAHLLPAEKSSA 1413 



Db 



Qy 



864 TDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFS 920 

I I = : :|: | |:| | | : | :: :|| | 

1414 TGGHWTLQHAEDTVPLGPESQ AESI PI IVTPAPES- -TLHPDLQGEISASQRE 1464 



Db 



Qy 



921 



DDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKKLP 968 



Db 



1465 RSEEEDKPDAGPDADGKESTAIEKVLKAEPEILELESKSNKIVLNVIQTAVDQFARTETA 1524 



Qy 



969 SDTEKED 975 



Db 



1525 PETHAYD 1531 



RESULT 13 
5180808-2 

/Patent No. 5180808 

APPLICANT: RUOSLAHTI , ERKKI I. 

TITLE OF INVENTION: VERSICAN CORE PROTEIN, NUCLEIC ACID 
/SEQUENCES ENCODING THE SAME, NUCLEIC ACID PROBES, ANTI -VERSICAN 
/ANTIBODIES, AND METHODS OF DETECTING THE SAME 
NUMBER OF SEQUENCES: 4 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/44 1 , 179 
FILING DATE: 27-NOV-1989 
/SEQ ID NO: 2 : 

LENGTH: 24 0 9 
5180808-2 

Query Match 4.4%/ Score 263.5/ DB 6/ Length 2409; 

Best Local Similarity 20.1%/ Pred. No. 2.1e-07; 

Matches 253; Conservative 18 0/ Mismatches 479/ Indels 345/ Gaps 58; 

Qy 43 EDEDEDLEELEVLER- -KPAAGLSAAPVPTAPAAGAPL-MDFGNDFVPPAPRGPLPAAPP 99 

I : : I : : I I = I I I I II II || | : | 

Db 910 ENITQTSREI VISERLGEPNYGAEIRGFST GFPLEEDFSGDF REYSTVSHP 960 

Qy 100 VAPE RQPSWDPSPVSSTV PAPSPLSAAAVSPSKLP-EDDEP 139 

= 11 I II | ::| : I h II I I h 

Db 961 IAKEETVMMEGSGDAAFRDTQTSPSTVPTSVHISHISDSEGP-SSTMVSTSAFPWEEFTS 1019 

Qy 14 0 PARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQP 199 

I =11 II II I I |:: | 
Db 1020 SAEGSGEQLVTVSSSWPVL PS - -AVQKFSGTASSI I DEGL 1058 

Qy 200 GNTISAGQEDFPSVLLETAASXPSLS PLSAASFKEHEYLGNLSTVLP 246 

I : : I I : I I I = :h U : I Ml I 

Db 1059 GEVGTVNEIDRRSTILPTAEVEGTKAPVEK EEVKVSGTVSTNFPQTI EPAKLWSRQE 1115 

Qy 247 TEGTLQENVSEASKEVSEKAKTLLIDRDL TEFSELEYSEMGSSF 290 

: I : I I I I : I : : | || :|| : : 

Db 1116 VNPVRQE I ESETTSEEQI QEEKSFES PQNS PATEQTI FDSQTFTETELKTTDYSVLTTKK 1175 



Qy 2 91 SVSPKAESAVIVANPREE I IVKNKDEEEKLVS NNI LHXQQELPTALTKLVKED 343 

: I I 1 = 1= 11= I 

Db 1176 TYSDDKEM KEEDTSLWMSTPDPDANGLESYTTLPEATE KSH 1217 

Qy 344 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAG-GKIESNL 4 02 

= = = =1 : I = = ! = = = I = I I = : | : : : | : | | | | 

Db 1218 FFLATALVTESIPAEHWTDSPIKKEEST-KHFPKGMRPTIQESDTELLFSGLGSGEEVL 1276 

Qy 4 03 ESKVDKKCFADSLEQTN HEKDSES-SNDDTSFPSTPEGIKDRSGAYITCAPFNPA 456 

■ ■ = 11 I I I I hi : I : I - 

Db 1277 PTLPTESVNFTEVEQINNTLYPHTSQVESTSSDKIEDFNRMENVAKEVGPLVSQTDIFEG 1336 

Qy 457 ATES I ATN I F PLLEDPTS EN XTD EKKI EEKKAQI VTEKNTS 497 

: I = : I I : I M = = 1 = : I Hh^ 

Db 1337 SGSVTSTTLIEILSDTGAEGPTVAPLPFSTDIGHPQNQTVRWAEEIQTSRPQTITEQDSN 1396 

Qy 4 98 TKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEVTGTKI 557 

= 1 I ll = = == =1 I = h II I || 

Db 1397 KNSSTAEINETTTSSTDFLARAYGFEMAKEFVTSAPK- - PSDLYYEPSGEGSGEV 1449 

Qy 558 AYETKMDLVQ TSEVMQESLYP AAQLCPSFEESEATPSPVLPDIVMEAP 605 

1 = 1 I III = = I ::| : | : : | 

Db 1450 DIVDSFHTSATTQATRQESSTTFVSDGSLEKHPEVPSAKAVTADGFPTVSVMLP 1503 

Qy 606 LNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKVSGIKEEIKEP 665 

1 = 1 I III l = = = - :| ::: : | :| |:| 

Db 1504 LHSE QNKSSPDPTSTLS-NTVSYERSTDGSFQDRFR-EFEDSTLKPNRKKP 1552 

Qy 666 ENINAALQETEAPYI S I ACDLI KETK LSAEPAPDF-SDYSEMAKVEQPVPD 715 

M :| II II I | |: || : : := | 

Db 1553 TE NI I IDLDKEDKDLILTITESTILEILPELTSDKNTIIDIDHTKPV 1599 

Qy 716 HSELVE DSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIE 766 

h III I :|:| |||: | ||ll • I I : 

Db 1600 YEDI LGMQTDI DTEVPSEPHD - SNDESNDDSTQVQE I YEAAVNLSLTEETFEGSADVLAS 1658 

Qy 767 YENKEKL SALPPEGGKPYLESFKLSLDNTKDTLLPDEVS -T 806 

II- : I = =1 I : I III I 

Db 1659 YTQATHDESMTYEDRSQLDHMGFHFTTGIPAP STETELDVLLPTATSLP 1707 

Qy 807 LSKKEK- -IPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSPI EI IDEFPTLI 860 

: ; l N = = l = = Ihl 1=111 I III III 

Db 1708 I PRKSATVI P-EIEGIKAEAKALDDMF ESSTLSDGQAIADQSEI I PTL- 1754 

Qy 861 S S KTDS FS KLARE YTDLE VSHKSEIANAPDGAGSLPCTELPH 902 

I = Ml: I I - I ||:| 

D ^ 1755 GQFERTQEEYEDKKHAGPSFQPEFSSGAEEALVDHTPYLSIATTHLMDQSVTEVPD 1810 

Qy 903 DLSLKNI QPKVEEKI SFSDDFSKNGSAT- SKVLLLPPDVSALGHTQ 947 

' I : = = I hi I I I I : I llh 

Db 1811 VMEGSNPPYYTDTTLAVS -TFAKLSSQTPSSPLTI YSGSEASGHTEI PQPSALPGIDVGS 1869 

Qy 948 AEIESIVKP KVLEKEAEKKLPSDTE KEDRSPSAIFSADLG 987 

11= II = I I |h ::| | : : 

Db 1870 SVMSPQDSFKEIHVNIEATFKPSSEEYLHITEPPSLSPDTKLEPSEDDGKPELLEEMEAS 192 9 



Qy 



988 KTSW DLLYWRDI KKTGWFGASLFLLLSL TVFSI VSVTAYIALA 1032 

I - -I | | | | :: : :: :::: | | 

193 0 PTELI AVEGTE I LQDFQNKTDGQVSGEAI KMFPTI KTPEAGTVI TTADE I ELEGATQWPH 198 9 



Db 



Qy 



1033 LLSVTISFRI YKGVI QAI QKSDEGHP - FRAYL ESEVAISEELV 1074 



Db 



1990 STSASATYGVEAGWPWLSPQTSERPTLSSSPEINPETQAALIRGQDSTIAASEQQV 2 046 




RESULT 14 
US-08-769-309A-5 

; Sequence 5, Application US/08769309A 
; Patent No. 5741890 

; GENERAL INFORMATION: 

APPLICANT: Scott, John D . , 
APPLICANT: Nauert , Brian J., 
APPLICANT: Klauck, Theresa M. 

TITLE OF INVENTION: Protein Binding Domains of Gravin 
NUMBER OF SEQUENCES: 24 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Marshall, O' Toole, Gerstein, Murray & Borun 

STREET: 6300 Sears Tower/233 South Wacker Drive 

CITY: Chicago 

STATE: Illinois 

COUNTRY: United States of America 

ZIP : 60606-6402 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08 /769 , 3 09A 

FILING DATE: 

CLASSIFICATION: 43 5 
ATTORNEY /AGENT INFORMATION: 

NAME: No. 5741890and, Greta E . 

REGISTRATION NUMBER: 35,302 

REFERENCE/DOCKET NUMBER: 27866/33451 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 312-4 74-63 00 

TELEFAX: 312-4 74-044 8 

TELEX: 25-3856 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 178 0 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-769-309A-5 

Query Match 4.3%; Score 256; DB 1; Length 178 0; 

Best Local Similarity 19.2%; Pred. No. 4e-07; 

Matches 221; Conservative 163; Mismatches 419; Indels 348; Gaps 45; 



QY 



1 MEDLDQSPLVSS- -SDSPPRPQPAFKYQFV 



REPEDEEEEEEEEE 42 



Db 775 LEEKSEDSIAGSGVEHSTPDTEPGKEESWVSIKKFI PGRRKKRPDGKQEQAPVEDAGPTG 834 

QY 4 3 -EDEDED LEELEVLERKPAAGLSA APVPTAPAAGAPLMDFGNDFV- - PPAP 90 

-II ||: :||: I I I || : | | 

Db 835 ANEDDSDVPAWPLSEYDAVEREKMEAQQAQKGAEQPEQKAATEVSKELSESQVHMMAAA 8 94 

Qy 91 RGPLPAAPPVAPERQPSWDPSPVS STVPAPSPLSAAAVSPSKLPEDDEPPARPPPPP 14 7 

I = II Ml = h I I = I I -Ml | | 

Db 895 VADGTRAATI I EERS PSW I SAS VTEPLEQVEAEAALLTEEVLERE VI AEEE PPTVTEPLP 954 

Qy 148 PASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQ 207 

Db 955 ENREARGDTWSEA 968 

Qy 208 EDFPSVLLETAASXPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAK 267 

I I : I II | | | : || : | :| ||: 

Db 969 ELTPEAV- -TAA ETAGPLGSEEGTEASAAEETTEMVSAVSQLTD 1010 

Qy 268 TLLIDRDLTEFSELE YSEMGSSFSVSPKAESAVI VANPREEI I - -VKN 313 

: I hi |: : : | || : | ::: | : 

Db 1011 SPDTTEEATPVQEVEGGVPDIEEQERRTQEVLQAVAEKVKEESQLPGTGGPEDVLQPVQR 107 0 

Qy 314 KDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNE KRVAVEAPMRE 368 

: I : I I M H I |::= | Ml 
Db 1071 AEAE RPEEQAEASGLKKETDWLKVDAQEAKTEPFTQGKWGQTTP 1116 

Qy 369 EYADFKPFERVWEVKDSKEDSDMIA AGGKIESNLESKVDKKCFADSLE-QTNH 420 

= 11= =1 =| ||::: II I : | : = = ||:| |: 

Db 1117 ESFEKAPQVTES I ESSELVTTCQAETLAGVKSQ EMVMEQAI PPDSVETPTDS 1168 

Qy 421 EKBSESSNDDTSFPSTPE5X : GIKDRSGAYI TCAPFNPAATES I ATNI FPLLE 470 

I I : I I h | : : :: III 

Db 1169 ETDGSTPVADFDAPGTTQKDEIVEIHEENEVHLVPVRGTEAEAVPAQKERPPAPSSFVFQ 122 8 

Qy 4 71 DPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVT 530 

Db 122 9 EETKEQSKMEDTLEH TDKEVSVETVSIL SKTE- -GTQEADQYADEKTK 1274 

Qy 53 1 NMP- -EGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEE 588 

: : I Ml | - M :: | :: | :|: 

Db 1275 DVPFFEGL EGS ID-- TG I TVSREKVTEVALKGEGTEEAECKKDDALELQSH 1323 

Qy 589 SEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPE 640 

WW -I- - : | : || | ::|| 

Db 1324 AKSPPSPVEREMWQ VEREKTEAEPTHVNEEKLEHE1AVTVSEEVS 1369 

Qy 64i NPPP YEEAMSVSLKVSGIKEEIKEPENINAALQE 674 

= 111 llh ::| | 
Db 1370 KQLLQTVN VP 1 1 DGAKE VS SLEGS P P PCLGQE EAVCTKI Q V QS 1412 

Qy 675 TEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLF- 733 

: H : I =1 h I I = | : :| | | | || :| 

Db 1413 SEASFTLTAA- -AEEEKVLGETA NILETGETLEPAGAHLVLEEKSSEKNEDFAAHP 1466 

Qy 734 SDDSI PDVPQKQDETVMLVKESLTETSFESMI EYENKEKL 77^ 

I I - - : |: | :| | | 

Db 1467 GEDAVPTGPDCQAKSTPVIVSATTKKGLSSDLEGEKTTSLKWKSDEVDEQVACQEVKVSV 1526 



Qy 


774 


- -SALPPEGGKPYLE- -SFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVYSNDD 


829 


Db 


1527 


1 1! 1 II 1 II : 1 1 : || || :| : ::: 

AIEDLEPENGILELETKSSKL-VQNIIQTAVDQFVRT EETATEMLTSE- 


1573 


Qy 


83 0 


LFISKEAQI RETETFSDSSPI EI IDEFPTLISSKTDSFSKLAREYTDL- -E 

i iii iii i 


878 


Db 


1574 


: H : : | | | : | | | = = : : : : : | : : 
--LQTQAHVI KADSQDAGQETEKEGEEPQASAQDETPITSAKEESESTAVGQAHSDISKD 


1631 


Qy 


879 


VSHKS E I ANAPDGAGS LPCTE LPHD- LSLKNI 


J U 


Db 


1632 


H II : II Ml =| | : : 
MSEASEKTMTVEVEGSTVNDQQLEEWLPSEEEGGGAGTKSVPEDDGHALLiAERIEKSLV 


1691 


Qy 


910 


QPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKE 


962 






=11 HI II II : IN | || ||hom 




Db 


1692 


EPKEDEKGDDVDDPENQNSALA DTDASGGLTKESPDTNGPKQKEKEDAQEVEL 


1744 


Qy 


963 


AEKKLPSDTEK 973 




Db 


1745 


QEGKVHSESDK 1755 




RESULT 


15 






US-08- 


994-570- 


-5 





Sequence 5 # Application US/08994570 
Patent No. 6090929 
GENERAL INFORMATION: 

APPLICANT: Scott , John D., 

APPLICANT: Nauert , Brian J., 

APPLICANT: Klauck, Theresa M. 

TITLE OF INVENTION: Protein Binding Domains of Gravin 
NUMBER OF SEQUENCES: 24 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Marshall, 0' Toole, Gerstein, Murray & Borun 

STREET: 6300 Sears Tower/233 South Wacker Drive 

CITY: Chicago 

STATE: Illinois 

COUNTRY: United States of America 

ZIP: 60606-6402 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patent In Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/994,570 

FILING DATE: 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 

NAME: No. 6090929and, Greta E. 

REGISTRATION NUMBER: 35,3 02 

REFERENCE/DOCKET NUMBER: 27866/33451 
TELECOMMUNICATION INFORMATION: 

TELEPHONE : 312-474-6300 

TELEFAX: 312-474-0448 

TELEX: 25-3856 
INFORMATION FOR SEQ ID NO: 5: 



SEQUENCE CHARACTERISTICS: 
LENGTH: 178 0 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-994-570-5 

Query Match 4.3%; Score 256; DB 3; Length 1780; 

Best Local Similarity 19.2%; Pred. No. 4e-07; 

Matches 221; Conservative 163; Mismatches 419; Indels 348; Gaps 45; 

Qy 1 MEDLDQSPLVSS- -SDSPPRPQPAFKYQFV REPEDEEEEEEEEE 42 

Db 775 LEEKSEDSIAGSGVEHSTPDTEPGKEESWVSIKKFIPGRRKKRPDGKQEQAPVEDAGPTG 834 

Qy 43 -EDEDED LEELEVLERKPAAGLSA APVPTAPAAGAPLMDFGNDFV- - PPAP 90 

-II I I : :|h I I I || : | | 

Db 83 5 ANEDDSDVPAWPLSEYDAVEREKMEAQQAQKGAEQPEQKAATEVSKELSESQVHMMAAA 894 

Qy 91 RGPLPAAPPVAPERQPSWDPSPVS STVPAPSPLSAAAVSPSKLPEDDEPPARPPPPP 147 

I ■ M IN = h I I : | I -Ml I I 

Db 895 VADGTRAATI I EERSPSWI SASVTEPLEQVEAEAALLTEEVLEREVI AEEEPPTVTEPLP 954 

Qy 148 PASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQ 207 

Db 955 ENREARGDTWSEA 968 

Qy 2 08 EDFPSVLLETAASXPSLSPLSAASFKEHEYLGNLST^ 267 

Db 969 ELTPEAV- -TAA ETAGPLGSEEGTEASAAEETTEMVSAVSQLTD 1010 

Qy 268 TLLI DRDLTEFSELE YSEMGSSFSVSPKAESAVI VANPREEI I - - VKN 313 

Db 1011 SPDTTEEATPVQEVEGGVPDIEEQERRTQEVXQAVAEKVKEESQLPGTGGPEDVLQPVQR 1070 

Qy 314 KDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNE KRVAVEAPMRE 368 

: I = I : I I I = I I 1 = :: I III 
Db 1071 AEAE RPEEQAEASGLKKETDWLKVDAQEAKTEPFTQGKWGQTTP 1116 

Qy 369 E YADFKPFERVWE VKDS KEDSDMLA AGGKIESNLESKVDKKCFADSLE-QTNH 420 

: 11= : I =1 I l = = : III: I ||:| |: 

Db HI 7 ESFEKAPQVTESIESSELVTTCQAETLAGVKSQ EMVMEQAI PPDSVETPTDS 1168 

Qy 421 EKDSESSNDDTSFPSTPE GIKDRSGAYI TCAPFNPAATES I ATNI FPLLE 470 

I I : I I h I : : :: I | || I 

Db 1169 ETDGSTPVADFDAPGTTQKDEIVEIHEENEVHLVPVRGTEAEAVPAQKERPPAPSSFVFQ 1228 

Qy 471 DPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWA 530 

■ I I I : I hi I :| : hh | : :| 

Db 1229 EETKEQSKMEDTLEH TDKE VS VETVS I L SKTE- -GTQEADQYADEKTK 1274 

Qy 531 NMP- -EGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEE 588 

-I Ml I - || :: | == | :|: 

Db 1275 DVPFFEGL EGS ID- - TGI TVSREKVTEVALKGEGTEEAECKKDDALELQSH 1323 

Qy 589 SEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPE 640 

MM - I : : - : | = || | ::|| 



Db 1324 AKSPPSPVEREMWQ VEREKTEAEPTHVNEEKLEHETAVTVSEEVS 1369 

Qy 641 NPPP YEEAMSVSLKVSGI KEE I KEPENI NAALQE 674 

= 111 |||: -I | 

Db 1370 KQLLQTVN VP 1 1 DGAKEVS S LEGS P P PCLGQE EAVCTKI Q V QS 1412 

QY 675 TEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLF- 733 

: I I : I =1 1= II : I : :| I I I II :| 

Db 1413 SEASFTLTAA- -AEEEKVLGETA NILETGETLEPAGAHLVLEEKSSEKNEDFAAHP 1466 

Qy 734 SDDS I PDVPQKQDETVMLVKESLTETS FESM I E YENKEKL 773 

:|-| | | :: :: : |: | :| | | 

Db 14 67 GEDAVPTGPDCQAKSTPVIVSATTKKGLSSDLEGEKTTSLKWKSDEVDEQVACQEVKVSV 1526 

QY 774 - -SALPPEGGKPYLE- -SFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVYSNDD 829 

I II I II I || : | | : || || :| : ::: 

Db 1527 AIEDLEPENGILELETKSSKL-VQNI IQTAVDQFVRT EETATEMLTSE- 1573 

Qy 83 0 LFISKEAQI RETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDL- -E 878 

= =1 ■ :|ll : II | : : : : : ::|: : 

Db 1574 - -LQTQAHVI KADSQDAGQETEKEGEEPQASAQDETPITSAKEESESTAVGQAHSDI SKD 1631 

Qy 879 VSHKSEIANAPDGAGS LPCTE LPHD LSLKNI 909 

= 1 M : M II I : I I : : 

Db 1632 MSEASEKTMTVEVEGSTVNDQQLEEWLPSEEEGGGAGTKSVPEDDGHALLAERIEKSLV 1691 

Qy 910 QPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKE 962 

: 1 1 : I I M II = I I I I M III 

Db 1692 EPKEDEKGDDVDDPENQNSALA DTDASGGLTKESPDTNGPKQKEKEDAQEVEL 1744 

Qy 963 AEKKLPSDTEK 973 

I h h-l 
Db 174 5 QEGKVHSESDK 1755 

Search completed: January 22, 2004, 16:34:54 
Job time : 33.5792 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: January 22, 2004, 16:31:15 ; Search time 32.1516 Seconds 

(without alignments) 
3523.514 Million cell updates/sec 

Title: US-09-830-972-29 
Perfect score: 5923 

Sequence: 1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKI QAKI PGLKRKAE 1178 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 283308 seqs, 96168682 residues 

Total number of hits satisfying chosen parameters: 283308 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : PIR_76:* 
1: pirl:* 
2: pir2:* 
3: pir3:* 
4: pir4:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


787. 5 


13 


.3 


776 


2 


A46583 


neuroendocrine- spe 


2 


688 


11 


6 


208 


2 


160904 


neuroendocrine- spe 


3 


671 


11 


3 


267 


2 


A60021 


tropomyosin-relate 


4 


528 


8 


9 


2484 


2 


T26216 


hypothetical prote 


5 


519. 5 


8 


8 


2607 


2 


T26215 
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ALIGNMENTS 



RESULT 1 
A46583 

neuroendocrine-specif ic protein, splice form A - human 

N; Contains: neuroendocrine-specif ic protein, spl | ce form B 

C; Species: Homo sapiens (man) 

C;Date: 24-May-1996 #sequence_revision 24-May-1996 #text_change 05-Nov-1999 
C;Accession: A46583; 160903 

R;Roebroek, A.J.; van de Velde, H.J.; Van Bokhoven, A.; Broers, J.L.; Ramaekers, 
F . C . ; Van de Ven , W.J. 

J. Biol. Chem. 268, 13439-13447, 1993 

A; Title: Cloning and expression of alternative transcripts of a novel 
neuroendocrine-specif ic gene and identification of its 135-kDa translat ional 
product . 

A; Reference number: A46583; MUID : 93293865 ; PMID:7685762 
A; Access ion: A4 6583 

A,-Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;ResidueS: 1-776 <R0E1> 

A; Cross -references : GB:L10333; NID : g307306 ; PIDN : AAA59950 . 1 ; PID:g307307 
A; Accession : 1609 03 



A; Status : preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 421-776 <ROE2> 

A; Cross-references : GB:L10334; NID : g307308 ; PIDN : AAA59951 . 1 ; PID:g307309 
C;Genetics : 

A; Gene: GDB : RTN1 ; NSP 

A; Cross-references : GDB: 2 03 968 ; OMIM : 600865 
A ; Map position: 14q21-14q22 

Query Match 13.3%; Score 787.5; DB 2; Length 776; 

Best Local Similarity 31.9%; Pred. No. 3.1e-25; 

Matches 229; Conservative 91; Mismatches 190; Indels 207; Gaps 25; 

Qy 588 ESEATPSPVLPDI - -VMEAPLNSA VPSAGASVIQPSSSPLE- - -ASSVNY 632 

■■ ■ I h h I = : Ih I : I 

Db 141 EELGTPGPSLPDVPGIESRGLFSSDSGIEMTPAESTEVNKILADPLDQMKAEAYKYIDIT 200 

QY 633 --ESIKHEPENPPPYEEA MSVSLKVSGIKEEIKEPENINAAL QET 675 

I Mh == I h =|:| |::| I I = = : | 

Db 201 RPEEVKHQEQHHPELEDKDLDFKNKDTDI S I KPEGVREPDK- PAPVEGKI I KDHLLEEST 25 9 

Qy 676 EAPYISIACDL IKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVED 722 

I I I I II :| | Ih : :: |: :: 

Db 260 FAPYID DLSEEQRRAPQITTPVKI TLTEIEPSVE TTTQEKTPEKQDI CLK 309 

QY 723 SSPDSEPVDLFS DDS 1 PDVPQKQ DETVMLVKESLTETSF 761 

Mh I I III I : | || : :||: 
Db 310 PSPDTVPTVTVSEPEDDSPGSI TPPSSGTEPSAAESQGKGS ISEDELITAI KEA 363 

QY 762 ESMI EYENKEKLSALPPEGGKPYLESFKLSLDMTKDTLLPDEVSTLSKKEKI PLQMEELS 821 

: II I : :| Mill::: 
Db 364 - KGLS YETAENPRPVGQLADRP EVKARSGPPTI PSPLDHEA 403 

QY 822 TAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFP TLISSKTDSFS 868 

: : ' I I II- I I : I I I 

Db 404 SSAESGD S E I ELVS EDPMAAEDALPSG Y VS FGHVGG P P PS P 444 

QY 869 KLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSD 921 

= II = h = I = = | : | | | | | I 

Db 445 ASPS I QYS I LREEREAELDSELI I ESCDASSAS EESPKREQDSPPMKPSALD 496 

QY 922 DF SKNGSATSKVLL L P PD VSALGHTQAE I E S I VKP 956 

1= M I Ml M I |= : | 

Db 497 AIREETGVRAEERAPSRRGLAEPGSFLDYPSTEPQPGPELPPGDGAL- EPETPMLP 551 

QY 957 KVLEKEAEKKLPSDTEKEDRSPSA 1 FSADLGKTS WDLLYWRDI K 1001 

: 1 : ::|hl :| 'I =111111111 
°b 552 RKPEEDSSSNQSPAATKGPGPLGPGAPPPLLF LNKQKAIDLLYWRDIK 599 

Qy 1002 KTGWFGASLFLLLSLTVFSI VSVTAYIALALLSVTI SFRI YKGVI QAIQKSDEGHPFRA 1061 

= IMIh I II III IhlM I hi 1 1 II Mill I II hlhlhl 1 1111 = 1 

Db 600 QTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVLQAVQKTDEGHPFKA 659 

Qy 1062 YLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1121 

_ Ml 1= =1 = 1 =111 = = =11 hi III Ml II 1111111111111= III Ml | 

Db 660 YLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGALF 719 



Qy 1122 NGLTLL I LALI SLFS VPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

I I I I I I - h h I - I h I MMMI I I I I = = Mill II I I 
Db 72 0 NGLTLLLMAWSMFTLPWYVKHQAQIDQYLGLVRTHINAWAKIQAKIPGAKRHAE 776 



RESULT 2 
160904 

neuroendocrine-specif ic protein C - human 
C; Species: Homo sapiens (man) 

C;Date: 24-May-1996 #sequence_revision 24 -May- 1996 #text_change 05-Nov-1999 
C; Access ion: 160904 

R;Roebroek, A.J. ; van de Velde, H.J.; Van Bokhoven, A.; Broers, J.L.; Ramaekers, 
F.C. ; Van de Ven, W.J. 

J. Biol. Chem. 268, 13439-13447, 1993 

A; Title: Cloning and expression of alternative transcripts of a novel 
neuroendocrine-specif ic gene and identification of its 135-kDa translational 
product . 

A; Reference number: A46583; MUID : 93293865 ; PMID : 7685762 
A; Access ion: 160904 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: E TKL 
A;Residues: 1-208 <RES> 

A;Cross-references : GB:L10335; NID:g307310; PIDN : AAA59952 . 1 ; PID:g307311 
C;Genetics : 

A; Gene: GDB : RTN1 ; NSP 

A; Cross-references : GDB:203968; OMIM: 600865 
A ; Map position: 14q21-14q22 



Query Match 11.6%; Score 688; DB 2; Length 208; 

Best Local Similarity 67.5%; Pred. No. 6.2e-22; 

Matches 129; Conservative 32; Mismatches 30; Indels 0; Gaps 0; 



Qy 


988 


KTS WDLLYWRD I KKTG WFGAS LFLLLS LTVFS I VS VTA Y I ALALLS VT I S FR I YKG VI 


1047 


Db 


18 


h :|lllllllhlhll|: 1 || Ml ||:||| ||:||| || f 1 I I ( 1 1 1 h 
KSQAI DLLYWRDI KQTGI VFGSFLLLLFSLTQFS WSWAYLALAALSATI SFRI YKSVL 


77 


Qy 


1048 


QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVnDLVDSLKF 


1107 


Db 


78 


Ihlhlllllhllll h :hl 1 h :|| hlllllllll IIIIIIM 

QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKF 


137 


Qy 


1108 


AVLMWVFTlC5raEXTDLEV*p7ABXSLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQ 
Mllh ! , :|||||| Mil :|||| 


1167 


Db 


138 


AVLMWLLTYVGALFNGLTLLLMAWSMFTLPWYVKHQAQI DQYLGLVRTH I NAWAKI Q 


197 


Qy 


1168 


AKI PGLKRKAE 1178 




Db 


198 


Mill M M 

AKIPGAKRHAE 208 





RESULT 3 
A60021 

tropomyosin- related protein, neuronal - rat 
C;Species: Rattus norvegicus (Norway rat) 

C;Date: 03-Mar-1993 #sequence_revision 03-Mar-1993 #text_change 05-Nov-1999 

C; Access ion: A60021 

R;Wieczorek, D.F.; Hughes, S.R. 

Brain Res. Mol . Brain Res. 10, 33-41, 1991 



A;Title: Developmental ly regulated cDNA expressed exclusively in neural tissue. 
A;Reference number: A60021; MUID: 91278684 ; PMID: 1647480 
A /Accession : A60021 
A; Molecule type; mRNA 
A;Residues: 1-267 <WIE> 

A; Cross-references : EMBL: X52817 ; NID:g456549; PIDN : CAA37001 . 1 ; PID:g456550 
C;Comment: This neuronal -specif ic mRNA was identified by hybridization to an 
alpha -tropomyosin probe but does not show homology in amino acid sequence. 

Query Match 11.3%; Score 671; DB 2; Length 267; 

Best Local Similarity 66.3%; Pred. No. 4.4e-21; 

Matches 124; Conservative 34; Mismatches 29; Indels 0; Gaps 0; 



Qy 988 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1047 

h :|IMIIMhl|:|||: I || ||| , ||:||| || ||MIIII h 

Db 9 KSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVL 68 

Qy 1048 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKF 1107 

IHhlllllhllll h :h| :|||:: :|| hlMIIIMI llllllll 

Db 69 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDLVDSLKF 128 

Qy 1108 AVLMWVFTYVGALFNGLTLLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI Q 1167 

I I' I I Li I I h ■ I : M l 'I :: I 

Db 12 9 AVLMWLLTYVGALFNGLTLLLMAWSMFTLPWYVKHQAQVDQYLGLVRTHINTWAKIQ 188 

Qy 1168 AKIPGLK 1174 

Mill : 

Db 189 AKIPGAR 195 



RESULT 4 
T26216 

hypothetical protein W06A7.3c - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence__revision 15-Oct-1999 #text_change 15-Oct-1999 
C;Accession: T26216 
R;Ains cough, R. 

submitted to the EMBL Data Library, August 1996 
A; Reference number: Z2 0173 
A; Access ion: T26216 

A; Status : preliminary; translated from GB/EMBL/DDB J 
A;Molecule type: DNA 
A;ResidueS: 1-2484 <WIL> 

A; Cross-references: EMBL:Z78066; PIDN : CAB51467 . 1 ; GSPDB : GN0 0023 ; CESP : W06A7 . 3c 

A; Experimental source: clone W06A7 

C; Genetics : 

A ; Gene : CESP : W0 6A7 . 3 c 

A; Map position: 5 

A;Introns: 1827/1; 1866/3; 1963/3; 1990/3; 2262/1; 2289/1; 2339/2; 2463/2 

Query Match 8.9%; Score 528; DB 2; Length 2484; 

Best Local Similarity 20.9%; Pred. No. 7.4e-14; 

Matches 277; Conservative 186; Mismatches 461; Indels 402; Gaps 54; 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEE EE 38 

= 1-1 I : I I I hi |= =| 

Db 13 97 LEKVEVQP- -DLSQNSPAPHKI IDLHF-NI PKDHEDYGNDYVPFGTESSEESQKADGNQE 14 53 



39 EEEEED EDEDLEELEVLERKPA AGLSAAP 67 

HIM Mh h h I : h 

14 54 NQEEEDWAELNFHPI RQWRDEDVI SLQSLKSLVAEVGCI TDVDASDVNEQDEESTLKI L 1513 

68 - -VPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAP- -ERQPSW 108 

lh h :|| II I hi I || 

1514 KWPSEPSLLE- -LDFTND- -PKVIHVPI PLME PATMYLE EMVEW I IADAVKEVSEMEW 1569 

109 DPSPVSSTVPAPSPLSAAAVSPSKLP-EDDEPPARPPPPPPASVSPQAEPVW 159 

I M I I II : I I I MM I I I | : h 

1570 TESEISEMAPQVSESTCPIPEPL ADLKLPVEDDEKTPEPEPWPGQVQERI I PI E 1624 

160 TPPAPA - PAAP PSTPAAPKRRGSSGAWXXXXKI MDLKEQPGNTI SAGQEDFPS VL - - LE 216 
UE M I II I | | : 

1625 VEQAPTI PQRPPRAP KSELPKVAKPLD 1651 

217 TAASXPSLSPLSA ASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLL 270 

1652 DSKSRVRFAPLNIKLGRTYSEEQQKELVESLERPL-TI ITQQKPPEKPTEDIG- -ALSPL 1708 

271 IDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEI I VKNKDEEEKLVSNNILHXOO 330 

I \ \ : I I : MM I II : M I 
1709 S PNTLAEYEE VPMMDMQS VPHSPQE KQEEIEALSEI I 1745 

331 ELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADFK---PFERVWEVKDSKE 387 

1746 EEPQAMKEVEKPVE-SAPEKDNESLEAPEI -INEPIRRVLVETKIMGPGKSLNE^ 1803 

'388 D-SDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESS NDDTSFPSTPEGIKD 442 

I 1= I : I : | : : | h: : Ml II I | : |l*50 

1804 DGSECLDS IGDLS ERTI QR- - FNTS I DDPS I RRDSFSS I SSFGDRQKFRTAI ENI RQ 1858 

443 RSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQI VTEKNTSTKTSN 502 

M I : Ml 
1859 DLLPFQSSVSQYLRSSPNP SQQLLVTN 1885 

503 PFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETK 562 
Ml :| = I III | | : : : | :: |: :| 

1886 LSMDSPSD- -LSPNAPPVGFENTAQFLEKLQQE-DRPSAEGSIDSSGFEKVDHE- - 1936 

563 MDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSS 622 

MM I : | | - | : ::\ \ 

193 7 GLDEFAAPP VHDPMQKSVFGSLGSDDMKPGS 1967 

623 SP LEASSVNYESI KHEPE AMSVSLKVSGIKgEI KEPEHI^T AAL 672 

M ^ I I : : hi == | | ||| 

1968 QDDGFVF I ERNEAN EATLKKNQKMSSHHNDVI EKNYFNDNAPTAAL 2013 

673 QETEAPYI S I ACDLI KETKLSAEPAPDFSDYSEMA KVEQ P VPDHS EL 719 

IM || = :: || hi : I Ml = I 

2014 - -LESPIAEEARKLVQDAVESA S E YKKQAVDSGDE I GRELLDNVEQKI EQVKEP 2065 

72 0 VEDS S PDSE PVDLFS DDS I PDVPQKQDETVMLVKESLTETS FESM I E YENKEKLSALP PE 779 
: II : I | M h I - I h M || 
2066 IVDSL- -HKAYDGVGDFVHETVPNAVDDFVREAEKQLPESPVPEKIE TPE 2113 



Qy 


780 


GGKPYLESFKLSLDNTKDTL- -LPDEVSTLSKKEKI PLQMEELSTAVYSNDDLFISKEAQ 


837 


Db 


2114 


PLVDIHDTVDKVHDEVDNFLRREPTP-PFETDDVAPLSDDKPQFGNQTP 


2161 


Qy 


838 


I RETETFSDSSPI EI I DEFPTLI SSKTDSFSKLAREYTDLEVSHKSEI ANAPDGAGSLPC 


897 


Db 


2162 


: M | : | : | : : : : : : | | : | | | : 
EEDETTFDRKGPLTI PEEVE KAAAAQNNDLD DFDPLVTSNTGAAFGAAVGAAAA- - 


2215 


Qy 


898 


TELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPK 


957 


Db 


2216 


| | : : || : | : : : | 
VESLTEEEM FGH- -QKFETVPRPP 


2237 


Qy 


958 




989 


Db 


2238 


| | | | | | : : | | || 
T PPKDI SDEDVKPSTV- - -NLGPSHHHSHPSS PHHS I LKHHGDAWI DFKTVP 


2286 


Qy 


990 


-S WDLLYWRDI KKTGWFGASLFLLLSLTVFSI VS VTAYI ALALLS VTI SFRI YKGVI Q 


1048 


Db 


2287 


M-MM ||: :| :| :| | : :::| | | | || :: | | 
PCVLDVI YWRDAKKS A I VLSLALLVLFVLAKY PLLT WT YSLLLALGAAAGFR VFKKVEA 


2346 


Qy 


1049 


AI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFA 
1:1:1 Ml | : : • • 1 1 • • * 1 1 . | . . 1 1 . . . 1 - 1 1 


1108 


Db 


2347 


III III 1 ■■••II • • -| | • | - - | | • • • I • I | 
QIKKTDSEHPFSEILAQDLTLPQEKVHAQADVFVEHATCIANKLKKLVFVESPLESIKFG 


2406 


Qy 


1109 


VLMWVFTYVGALFNGLTLLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI QA 
: : : 1 11 = = hi II II h =1111 HI :| || :| : ::|: | 


1168 


Db 


2407 


LVLWSLTYIASWFSGFTLAILGLLGVFSVPKVYESNQEAIDPHLATISGHLKNVQNIIDE 


2466 


Qy 


1169 


KIPGLK 1174 




Db 


2467 


KLPFLR 24 72 





RESULT 5 
T26215 

hypothetical protein W06A7.3a - Caenorhabditis elegans 
C;Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C;Accession: T26215 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A; Reference number: Z2 0173 
A /Accession: T26215 

A;Status: preliminary; translated from GB / EMBL/DDB J 
A; Molecule type: DNA 
A;Residues: 1-2607 <WIL> 

A; Cross -references: EMBL:Z78066; PIDN : CAB01522 . 2 ; GSPDB : GN00023 ; CESP : W06A7 . 3a 

A; Experimental source: clone W06A7 

C;Genetics : 

A; Gene : CESP : W06A7 . 3a 

A; Map position: 5 

A;Introns: 1827/1; 1866/3; 1963/3; 1990/3; 2262/1; 2289/1; 2412/1; 2462/2; 
2586/2 

Query Match 8.8%; Score 519.5; DB 2; Length 2607; 

Best Local Similarity 20.4%; Pred. No. 1.8e-13; 

Matches 292; Conservative 175; Mismatches 474; Indels 491; Gaps 55; 



Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEE EE 3 8 

:| - I I = I I I hi h :| 

Db 1397 LEKVEVQP- -DLSQNSPAPHKI IDLHF-NI PKDHEDYGNDYVPFGTESSEESQKADGNQE 1453 

Qy 3 9 EEEEED EDEDLEELEVLERKPA AGLSAAP 67 

HIM |||: h h I : h 

Db 1454 NQEEEDWAELNFHPIRQWRDEDVISLQSLKSLVAEVGCITDVDASDVNEQDEESTLKIL 1513 

Qy 68 - -VPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAP- -ERQPSW 108 

Ih h :|| III hi I || 

Db 1514 KWPSEPSLLE- -LDFTND- - PKVIHVPI PLMEPATMYLEEMVEWI IADAVKEVSEMEW 1569 

Qy 109 DPSPVSSTVPAPSPLSAAAVSPSKLP-EDDEPPARPPPPPPASVSPQAEPVW 159 

I II I I II = mOttW I I I d| |: 

Db 1570 TESEISEMAPQVSESTCPIPEPL ADLKLPVEDDEKTPEPEPWPGQVQERI I PI E 1624 

Qy 160 TPPAPA-PAAPPSTP AAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQE 208 

M I II I II I 
Db 1625 VEQAPTI PQRPPRAPKSELPKVAKPLDDSKS RVRFAPLNI KLGRTYSEEQQKEL 1678 

Qy 209 DFPSVLLETAASXP SLSPLSAASFKEHEY- -LGNLSTVLPTEGTLQEN 254 

Db 1679 VESLERPLTI I -TQQKPPEKPTEDIGALSPLSPNTLAEYEEVPMMDMQSVPHSPQEKQEE 1737 

Qy 255 VSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEI IVK- - 312 

■ II h h : : hi = : |: h I -h 

Db 1738 I -EALSEI IEEPQAM KEVEKPVESAPEKDNESLEAPEI INEPIRRVLVETK 1787 

Qy 313 NKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEA 3 64 

5X | : | : : 2 675*: : | ||: || 
Db 1788 IMGPGKSLNEDNDDDDDGSECLDS IGDL SERTIQRFN TSIDD 1829 

Qy 365 P- -MREEYADFKPFERVWEVKDSKED- -SDMLAAGGKIESNLESKVDKKCFADSLEQTNH 420 

Db 1830 PS I RRDSFSS I SSFGDRQKFRTAI ENI RQDLLPFQSSVSQYLRSS PNP SQQLLVTNL 1886 

QY 421 EKDSESSNDDTSFPSTPEGI KDRSGA- - YITCAPFNPAATESIATNIF 466 

II I I I : I I :|| | | : | | : 

Db 1887 SMDSPS DLSPNAPPVGFENTAQFLEKLQQEDRPSAEGSIDSSGFEKVDHEGLDEFAA 1943 

Qy 467 PLLEDPTSEN XTDEKK 1 EEKKAQ I VTEKNTSTKTSNPFFVAAQDS 511 

I - II - = h I II :| | | =|: | :: 

Db 1944 PPVHDPMQKSVFGSLGSDDMKPGSQDDGFVFIERNEANEATLKKNQKMSSHHNDVIEKNY 2003 

Qy 512 ETDYVTTDNLTKVTEEWANMPEGL CES ELNEV- TGTKI AYE TKMBfrV 566n 

II = I =1 I I h I I : I H :| I 

Db 2004 FNDNAPT AALLESPIAEEARKLVQDAVESASEYKKQAVDSGDEIGRELLDNVEQKIE 2060 

Qy 567 QTSEVMQESLYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLE 626 

I I : Hh I I I h I I I II 
Db 2061 QVKEPIVDSLHKAYDGVGDFVH-ETVPNAV-DDFVREAE 2097 

Qy 627 ASSVNYESIKHEPENPPPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDL 686 

I Ihl I hh II I : I : 

Db 2098 KQLPESPVP EKIETPE P LVD I HDTV 2122 



Qy 687 IK ETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDV 741 

I : I II I I :| :| |: | : : 
Db 2123 DKVHDEVDNFLRREPTPPFE TDDVAPLSDDKPQFGNQT 2160 

Qy 742 PQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLP 8 01 

-II | || :| 
Db 2161 - PEEDETTFDRKGPLT IP 2177 

Qy 802 DEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLIS 861 

= 11 I ■ I HI :hl I- 

Db 2178 EEV EKAAAAQNND LDDFDPLVT 2199 

Qy 862 SKT DSFSKLAR EYTDLEVSHKSEIANAPD 890 

II I : I : :| :| | : | 

Db 22 00 SNTGAAFGAAVGAAAAVESLTEEEMFGHQKFETVPRPPTPPKDI SDEDVK- PSTVNLGPS 22 58 

Qy 891 GAGSLPCTELPHD LSLKNIQPKVEEKIS FSDDFS 924 

I I : M : I : I : i |= || 

Db 2259 HHHSHPSS- -PHHSILKHHGDAWIDFKTVPPCAQNAFSPGEIMFLLAFFVYLSCFASFFS 2316 

Qy 925 KNGSATS KVLLL PPDVSALG 944 

\- =1 I =1 II 

Db 2317 KSLPLLDNLLSLWYLSISLIIHVKHHRKFRWNEEQATTMSKLGAVGRGLYALIAFIVNI 2376 

Qy 945 - - HTQAE I ES I VKPKVLEKEAEKKLPSDTEKEDRS PSA I FSADLGKTS WDLLYWRD I KK 1002 

= =1 I II I I III |:|::|||| II 

Db 2377 VLRVGLNVALWGVAVSAHEAYKLTKS SGVLRKKEVLDVI YWRDAKK 2423 

Qy 1003 TGVVFGASLFLLLSLTVFSI VSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1062 

: : I : l : l I : : : I I I I ll = =l I hhl III 

Db 2424 SAIVLSLALLVLFVIAKYPLLTVVTYSLLLALGAAAGFRVFKKVEAQIKKTDSEHPFSEI 2483 

Qy 1063 LESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1122 

I : : : : I I - : I I :|:=| h ::|:|| = = = I lh : h 
Db 2484 liAQDLTLPQEKVHAQADVFVEHATCIANKLKKLVFVESPLESIKFGLVLWSLTYIASWFS 2543 

Qy 1123 GLTLLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLK 1174 

I M II h HIM :|| :| || :| : ::|: | |:| 

Db 2544 GFTLAI LGLLGVFSVPKVYESNQEAI DPHLATI SGHLKNVQNI I DEJCLPFLR 2 595 

RESULT 6 
T26213 

hypothetical protein W06A7.3b - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revis ion 15-Oct-1999 #text_change 15-Oct-1999 
C;Accession: T26213 
R;Ains cough, R. 

submitted to the EMBL Data Library, August 1996 
A/Reference number: Z20173 
A; Accession : T2 6213 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A;Residues: 1-222 <WIL> 

A;Cross-references : EMBL:Z78066; PIDN : CAB01523 . 1 ; GSPDB : GN00023 ; CESP : W06A7 . 3b 
A; Experimental source: clone W06A7 
C; Genetics : 



A; Gene: CESP : W06A7 . 3b 
A; Map position: 5 
A;Introns: 27/1; 77/2; 201/2 



Query Match 5.7%; Score 34 0.5; DB 2; Length 222; 

Best Local Similarity 32.4%; Pred. No. 1.4e-07; 

Matches 66; Conservative 52; Mismatches 83; Indels 3; Gaps 2; 

IKEDRS PSAI FSADL - GKTS WDLLYWRD I KKTG WFGASLFLLLSLTVFS I VSVTAY I A 103 0 
I : h = = I II ::|::|lll Ih : I = I = | I : = : = | I 

:kyskqptwvpatdfpgk- - ILDVI YWRDAKKSAI VLSLALLA^FVIAKYPLLTWTYSL 66 

ALLSVTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNCTI K 1090 

I I I : I I hhl III | - : :| | :: : | | 

jLALGAAAGFRVF KKVEAQ I KKTDS EHP FS E I LAQDLTLPQE KVHAQAD VF VEHATC I AN 126 

ILRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDH 1150 
|::f |: UCfS|*25Xl| | | : : | : | | | XD |||| :|| :| || 

LKKLVFVESPLESIKFGLVLWSLTYIASWFSGFTLAILGLLGVFSVPKVYESNQEAIDP 18 6 

'LGLANKNVKDAMAKI QAKI PGLK 1174 
I : = = h I hi h 



Qy 


972 


Db 


9 


Qy 


1031 


Db 


67 


Qy 


1091 


Db 


127 


Qy 


1151 


Db 


187 


RESULT 


7 


138346 




elastic 


titin 



C; Species: Homo sapiens (man) 

C;Date: 29-May-1998 #sequence_revision 29-May-1998 #text_change 21-Jul-2000 

C;Accession: 138346 

R ; Label t, S.; Kolmerer, B. 

Science 270, 293-296, 1995 

A; Title: Titins : giant proteins in charge of muscle ultrastructure and 
elasticity. 

A; Reference number: A57430; MUID: 96026330; PMID:7569978 
A; Access ion: 13 834 6 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-7962 <RES> 

A; Cross -references : EMBL:X90569; NID : gl017426 ; PIDN : CAA6218 9 . 1 ; PID:gl017427 

C;Genetics: 

A; Gene: GDB : TTN 

A; Cross-references : GDB: 127867; OMIM: 188840 
A ; Map position: -2q31 

Query Match 5.6%; Score 331; DB 2; Length 7962; 

Best Local Similarity 20.0%; Pred. No. 4.7e-05; 

Matches 235; Conservative 144; Mismatches 404; Indels 392; Gaps 48; 

Qy 32 EDEEEEEEEEEEDEDEDLEELE- -VLERK PAAGLSAAPV PTAPA 73 

hi Mill I i hi hi I : : | | IH 

Db 6176 EEEREEEEEAEVTEYEVMEEPEEYWEEKLHIISKRVEAEPAEVTERQEKKIVLKPKIPA 6235 

Qy 74 AGAPLMDFGNDFVPPAPRGPLPAAP-PVAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSK 132 

: I h II = Ih: I I I I I III 

Db 6236 K IEEPPPAKVPEAPKKI VPEKK VPAPVP-KKEKVPPPK 6272 



Qy 133 LPEDDEPPA RPPP PPPASVS PQAEPVWTPPAPAPAAPPS-TPAA 175 

MM : I =11 lllh IN M : I 

Db 6273 VPEEPKKPVPEKKVPPKVIKMEEPLPAKVTEKHMQITQEEKVLVAVTKKEAPPKARVPEE 6332 

QY 176 PKRRGSSGAWXXXXKIMDLK EQPGNTI SAGQEDF PSVLLETAASXPS -LSPL 227 

III I- II hi - - | :| | : : 

Db 6333 PKR AVPEEKVLKLKPKREEEPPAKVTEFRKRWKEEKVSIEAPKREPQPIKEV 6385 

Qy 228 SAASFKEHEYLGNLSTVLPTEGTLQEN - VSEIASKEVSEKAKTLLI DRDLTEFSELEYSEM 286 

M I Ihl II :| |: : : I || I | :| 

Db 6386 TIMEEKERAY TLEEEAVSVQREEEYEEYE EYDYKEFEEYEPTEE 6429 

Qy 287 GSSFS VSPKAESAVI V- -ANPREEI 309 

POTE Ml | : | | : : 

Db 6430 YDQYEEYEEREYERYEEHEEYITEPEKPI PVKPVPEEPVPTKPKAPPAKVLKKAVPEEKV 6489 

Qy 310 IV KNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAP 365 

I M 1= :::::: || :| | |: || ||| | 

Db 6490 PVPI PKKLKPPPPKVPEEPKKVFEEKIHISITK- -REKEQVTEPAAKVPMKPKRVVAEEK 6547 

Qy 366 MREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSE 425 

= I M II I - | : : |: : : |: ||:: | 

Db 6548 VPVPRKEVAPPVRVPEVPKELEPEEV AFEEE WTHVE - EYLVEEEEEYI HEEE - E 6600 

Qy 426 SSNDDTSFPSTPEGI KDRSGAYI TCAPFNPAATES IATNI FPLLEDPTSENXTDEKKI EE 485 

I I : : III I | :: || || 

Db 6601 FITEEEWPVIPVKVPE VPRKPVPEEKKPVPVPKKKEAPPAKVPEVPKKPEE 6652 

Qy 486 KKAQIVTEKNTSTKTSNPFF VAAQDS ETD YVTTDNL 521 

I | |P$*p950XA| : 

Db 6653 KVPVLIPKKEKPPPAKVPEVPKKPVPEEKVPVPVPKKVEAPPAKVPEVPKKPVPEKKVPV 6712 

Qy 522 TKVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEV 571 

Ml : I I | | : | : : | : | | || 

Db 6713 PAPKKVEAPPAKVPEVPKKLIPEEKKPTPVPKKVEAPPPKVPKKREPVPVPVALPQEEEV 6772 

Qy 572 M-QESLYPAAQLCPSFEESEATP 593 

: : I : I - I IN I 
Db 6773 LFEEEIVPEEEVLP- -EEEEVLPEEEEVLPEEEEVLPEEEEIPPEEEEVPPEEEYVPEEE 6830 

QY 594 SPVLPDIVMEAPLNSAVPSAGASVIQ-- PSSSPLEASSVNYESI - 635 

III- : h : II | : h | | : | 

Db 6831 EFVPEEEVLPEVKPKVPVPAPVPEI KKKVTEKKWI PKKEEAPPAKVPEVPKKVEEKRI I 6890 

Qy 836 --KHE PENPPPYEEAMSVy EI KEPENIN 669 

II MM M Ih : II | 

Db 68 91 LPKEEEVLPVEVTEEPEEEPISEEEI PEEPPSIEEV EEVAPPRVPEVIK 693 9 

Qy 670 AALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEP 729 

Mil MM MM :M I == I 

Db 6940 KAVPEAPTPV PKKVEAPP AKVSKKI PEEKVPVPVQKKEAPP 6980 

Qy 73 0 VDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFK 78 9 

: M I M I I Ml I MM M 

Db 6981 A KVPEVPKKVPEKKVLV P KKEAVP PAKGR 7009 



Qy 790 LSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSP 849 

hi ::|| ::| : : || :: | :| = | | : : 

Db 7010 TVLEEKVSVAFRQEWVKERLELEWEAEVEE- - 1 PEEEEFHEVEEYFEEGE 7059 

Qy 850 IEI IDEFPTLISSKTDSFSKLAREYTDLEV- -SHKSEIANAPDGAGSLPCTELPHDLSLK 907 

-II I : : :: : : :|| : : |: | | ::| | 

Db 7060 FHEVEEFI KLEQHRVEEEHRVEKVHRVI EVFEAEEVEVFEKPKAPPKGP EISEK 7113 

Qy 908 NIQPK VEEKISFSDDFSKNGSATSKVLLLPPDVSAL 943 

I II lllh == HI 
Db 7114 IIPPKKPPTKWPRKEPPAKVPEVPKKIWEEKVRVPEE PRVPP 7157 

Qy 944 GHTQAE I ES I VKPKVLEKEAEKKLPSDTEKEDRSP 978 

:: :: || | |||:| |: :| 

Db 7158 TKVPEVLPPK- -EWPEKKVPVPPAKKPEAP 7 86 



RESULT 8 
T13564 

microtubule-associated protein homolog - fruit fly (Drosophila melanogaster) 
N;Alternate names: hypothetical protein EG:49E4.1 
C; Species: Drosophila melanogaster 

C;Date: 13-Aug-1999 #sequence_revision 13-Aug-1999 #text_change 17-Nov-2000 
C; Access ion: T13 564 

R;Spanos, L. ; Papagiannakis , G.; Siden-Kiamos , I.; Louis, C. 
submitted to the EMBL Data Library, April 1999 

A/Description: Sequencing the distal X chromosome of Drosophila melanogaster. 
A; Reference number: Z1768 9 
A; Accession : T13 564 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A;Residues: 1-5327 <SPA> 

A /Cross-references : EMBL: AL03 1128 ; PIDN : CAA2 0006 . 1 
C; Genetics : 

A; Cross-references : FlyBase : FBgn0025392 

A;Introns: 24/2; 52/3; 104/3; 179/1; 232/1; 1669/3; 2566/1; 4798/3; 5272/1 
A; Note: EG:4 9E4.1 

C; Superfamily : Drosophila 576K microtubule-associated protein homolog 

Query Match 5.5%; Score 325.5; DB 2; Length 5327; 

Best Local Similarity 23.6%; Pred. No. 4.6e-05; 

Matches 275; Conservative 151; Mismatches 520; Indels 219; Gaps 53; 



Qy 


28 


VREPEDEEEEEEEEEE DEDEDLEELEVLERKPAAGLS AAPVPTAPAAG 


75 


Db 


1277 


' ■ :hll | = : 1 1 1 : 1 1 : - | | | : 
MEQVKDKEEHEQKIESGI ITEKEAKKSASTPEEKETSDITSDDELPAQLADPTTVPPKSA 


1336 


Qy 


76 


APLMDFGNDFVPPA PRGPLPAAPPV-APERQPSWDPSPVSSTVPAPSPLSAA 


126 


Db 


1337 


1 1= II 1 II Ml : ||::| | 1 
KDREDTGSIESPPTIEEAIEVEVQAKQEAQKPVPAPEEAIKTEKSPLASK-ETSRPESAT 


1395 


Qy 


127 


AVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPA APK 


177 


Db 


1396 


= 111 1 h :h H 1 h 1 : I I 

GSVKEDTEQTKSKKSPVPSRPESEAKDKKSPFASGEASRPESVAESVKDEAGKA 


1451 


Qy 


178 


RRGS SGAWXXXXKI MDLKEQPGNTI SAGQE DFPS VLLETAASXP - SLS PLSA 


229 



II III h: 



Db 1452 RRESIAKTHKDESSLDKAKEQESRRESLAESIKPESGIDEKSALASKEASRPESVTDKS- 1510 

Qy 230 ASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSS 289 

M = = = I I I I : III I : =| || | = 

Db 1511 KEPSRRESIAESLKAESTKDEKSAPPSKEASRPGSWESVKDETEKSKEPSRRESIA 1567 

Qy 290 FSVSPKAESAVI VANPREEI I VKNKDEEEKLVSNN 1 LHXQQELPTALTKLVKEDEW 346 

I I I h I I H Mill = : | : : : | | : : : 

Db 1568 ESAKPPIEFRE-VSRP-ESVIDGIKDESAKPESRRDSPLASKEASRPESVLESVKDEPIK 1625 

Qy 347 SSEKAK DSFNEKRVAVE - APMREEYADFKPFERVWEVKDS - - KEDSDMIAAGGKI 398 

Ml- : I I I =h = U I I l : III =11 = 

Db 1626 STEKSRRESVAESFKADSTKDEKS PLTSKDI S - RPESAVENVMDAPFKETSRPESAVGSM 1684 

Qy 399 ESNLESKVDKKCFADSLEQ™KEp?ReSES3HDDTSFP-STPEGIKDRSGAYIT( 455 

M I = I II = : =11 I I I II II | 

Db 1685 KDESMSK EPSRRESVKDGAAQSRETSRPASVAESAKD- -GADDLKELSRP 1732 

Qy 456 AATE SIATNIFPLLEDPTS ENXTD- -EKKIEEKKAQI VTEKN- -TSTK 4 99 

: I M II : I h I II II : : | lh | : 

Db 1733 ESTTQSKEAGSIKDEKSPLASEEASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSKE 1792 

Qy 500 TSNPFFVAAQDSETDYVTTDNLTKVTEEWANMP EGLTPDLVQEACESELNEVTGT 555 

II II : I I II I I I I I- I 

Db 1793 ASRPASVA- - ES I KDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESI KDE AE 1845 

Qy 556 KIAYETKMDLVQTSEVM- -QESLYPAAQ 612 

Db 1846 KSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPL^ 1905 

Qy 613 SPLEASSVNYESIKHEPENFPpYEEAfilE^SLKVSGIKEEI KEPENIN 669 

III ' M -I I :|| : Ihl II 

Db 1906 RPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRE 1965 

Qy 670 AALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEP 729 

: : : = I II II I : I = I I I I : II 

Db 1966 SVAEKSPLP SKEASRPASVAESI KDEAEKSKEE SRRES VAEKS P 2009 

Qy 73 0 VDLFSDDS I PDVPQKQDETVMLVKESLTETS FESM I E - - YENKEKLSALP - PEGGKP - - Y 784 

: I I : | ||: : : :| | |: : | || | :| 

Db 2010 LPSKEASRPASVAES I KD EAEKSKEESRRES VAEKS PLPSKEASRPASV 2058 

Qy. 785 L E S F KLS LDNTKDTLL P DE VS TLS KKE KIP LQM E E LSTAVYSNDDLFISKEAQIRE 840 

M I = H =1 I II II =| II |= IN | 

Db 2059 AES I KDEAEKSK EESRRESAAEKSPLPSKEASRPASVAESVKDEADKSKEESRR- 2112 

QY 841 TETFSDSSPIEI I - -DEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCT 8 98 

I : = = l : I I : I h H H III): |||: 

Dt> 2113 - ESMAESGKAQS I KGDQSPLKEVSRPES VAESVKDDPVKSK- EPSRRESVAGSVTAD 2167 

Qy 899 ELPHDLSLKNI QPKVEEK 1 SFSDDFSKNGSATSKVLLLPPDVSALGHTQAE 94 9 

I - I =1 I Ihl 

Db 2168 SARDDQS PLES KGASRPES WDS VKDEAEKQES RRESK 22 05 

Qy 950 IESIVKPKVLEKEAEKKL PSDTEKEDR - S PSA I FSADLGKTS WDLL YW RD I 1000 

M:= M ^ - I- :=| =|| I h : h = : || 

Db 2206 TES VI PPKAKDDKSPKEVLQPVSMTETI REDADQPMKPSQAESRRES I AES I KASSPRDE 2265 



Qy 1001 KKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLSVTI SFRI YKGVI QAI QKSDEGHPFR 1060 

I M II = =1 = : I I II I I I 

Db 2266 KSP LASKEASRPGSV AES I KYDLDKPQ 1 1 KDDKSTE - HSRR 23 05 

Qy 1061 AYLESEVAI -SEELVQKYSNSALGH 1084 

Db 23 06 ESLEDKSAVTSEKSVSRPLSVASDH 233 0 



RESULT 9 
A47282 

calcium-binding protein calphotin - fruit fly (Drosophila melanogaster) 
C; Species: Drosophila melanogaster 

C;Date: 21-Sep-1993 #sequence_revision 18-Nov-1994 #text_change 21-Jul-2000 
C; Access ion: A4 7282 

R;Martin, J.H.; Benzer, S.; Rudnicka, M. ; Miller, C.A. 
Proc. Natl. Acad. Sci. U.S.A. 90, 1531-1535, 1993 

A; Title: Calphotin: a Drosophila photoreceptor cell calcium-binding protein. 
A/Reference number: A47282; MUID : 93165729 ; PMID:8094559 
'A; Accession: A4 71; 
A; Status : prel iminary 
A;Molecule type: nucleic acid 
A/Residues : 1-865 <MAR> 

A;Cross-references : GB:L02111; NID:gl57031; PIDN : AAA28405 . 1 ; PID:gl57032 
A; Experimental source: photoreceptor cells 

A;Note: sequence extracted from NCBI backbone (NCBIN: 124955 , NCBI P : 124956) 

C;Genetics: 

A; Gene : FlyBase : Cpn 

A; Cross-references : FlyBase : FBgn0010218 

C; Super family: collagen alpha 1(1) chain; fibrillar collagen carboxyl -terminal 
homology; von Willebrand factor type C repeat homology 
C;Keywords: calcium binding 

Query Match 5.5%; Score 324.5; DB 2; Length 865; 

Best Local Similarity 22.3%; Pred. No. 4e-06; 

Matches 229; Conservative 128; Mismatches 375; Indels 297; Gaps 49; 

Qy 5 9 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

I ^ I I I I I I Ih I Ml hi | |: | |:| 

Db 9 PVSAPVAAPV-TPSAVAAPVQWSPAAVAPAP APIAVTPVAPPPTLASVQPATV- - TIP 65 

Qy 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAP APAAPP--ST 172 

I h I : : I h h I II I l!h || : || I || ||| 

Db 66 APAPIAAASVTP VASVAPPWAAPTPPAA-SPVSTPVAVAQIPVAVSAPVAPPVAAT 121 

Qy 173 PAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASF 232 

I : : I = I II h III :|h h 

Db 122 PTPWQIPVAAPVIAT PPVAASA PT PAAVTPVISPVIAS- - 160 

Qy 233 KEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSV 2 92 

1 = 1 I | h : | | :| 
Db 161 PPWPANTT VPVAAPVAAVPAAVPWAPVLAP AV 194 

Qy 293 SPKAESAVIVAN PREEI I VKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWS 347 

H I :|| I II I :| : |: 2 ||: 

Db 195 APAV- -APWAETPAPPP VAEI P VAT IPECVAPLIPEVSWA 234 



Qy 348 SEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVD 4 07 

: I Mill I | :: :|: : : | 

Db 23 5 T KPLAAAE P VWAP PATET PWAPAAASPHVSVAPAVETAWAPVS 2 80 

Qy 408 KKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFP 467 

II = : I =| II II 
Db 281 ASTEPPV AAATLTTAPETPAL 3 01 

Qy 468 LLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEE 527 

I H I III I | 

Db 3 02 APWAESQ VAA NTWATPP 32 0 

Qy 528 WANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPA 579 

I I I : I : I : I II M II H I lh I I 

Db 321 TPAPEPETIAPPW AETPEVASVAVA-ETTPPW- -PPVAAESI -PAPWATTPV 371 

Qy 580 -AQLCPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHE 638 

I I = = h I M == =1= III = I |: I | 

Db 3 72 PATLAVTDPDVTASAVPELPPVIAPSPVPSAVAETPVDLAPPVLPPVAAEPVPAWAEET 431 

Qy 639 PENPPPYEEAMSV- SLKVSGI KEEI KEPENINAALQETEAPYI SI ACDLI KETKLSAEPA 697 

M I I :| : : | | : | || :| : :| || 

Db 432 PETPAPASAPVTIAALDIPEVAPVIAAPSDAPAEAPSAAAPI VSTPPTTASVPETTAPPA 491 

Qy 698 P DFSDYSEMAKVEQPVPDHSEL VEDSSPDSEPVDLFSDDSIP- -DVPQK 744 

11111=111 h I | :| || : | :| 

Db 492 AVPTEPIDVSVLSE-AAIETPVAPPVEVTTEVAVADVAPPEAAADLIIEPVEPPAPIPDL 550 

Qy 745 QDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEV 804 
: : I : I : :P : : | | | : : : : $ 60 

Db 551 LEQTTSVPAVEAAESTSSPIPE TSLPPPNEAVASPEVAVAPITAPEPIPEPEP 603 

Qy 805 STLSKKEKI PLQ MEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLI 860 

c I = I ll = = -I II I : II I III ||| = 

Db 6 04 SLATPTEPIPVEAPWIQEAVDAV EVPVTETST SI P-ETTVEFPEAV 64 9 

Qy 861 SSKTDSFSKLAREYTDLEV-SHKSEIANAPDGAGSLPCTELP HDLSL 906 

- I I 1= I : = -II III Mh I-: 

Db 650 AEKV LDPAITEAPVTTQEPDVANINDGA PAT E I TT PAVE I VTAAAE VS D I A I 701 

Qy 907 KNIQPKVEEKISFSDDFSKNGSATSKVLL LP PDVSALG 944 

I I I -I: : : : : ::|:: :| |:|| 

Db 702 PVIDPPVPQEIAVA-EIPETDTKPAEVIVEQSTIPIEAPVPEVSKYAEPVISEAPAAEVP 760 

Qy 945 HTQAE I ES I VKPKVLEKEAEKKLPSDTEKE QBOTFSADLGKTSWDLLY 996 

: I I =1 I = II h h - IN | : | : || 

Db 761 ITAGDNPDNTSVGISEW-PTIAEKPVEEVPTSEIPEQSSSPSD- -SVPVAK- -ITPLL- 814 

Qy 997 WRDIKKTGV 1005 

lh= I I 

Db 815 -RDLQTTDV 822 

RESULT 10 
A47283 

calphotin - fruit fly (Drosophila melanogaster) 



C; Species: Drosophila melanogaster 

C;Date: 21-Sep-1993 #sequence_revision 25-Apr-1997 #text_change 21-Jul-2000 
C; Access ion : A4 7283 

R;Ballinger, D.G.; Xue, N. ; Harshman, K.D. 

Proc. Natl. Acad. Sci . U.S.A. 90, 1536-1540, 1993 

A;Title: A Drosophila photoreceptor cell-specific protein, calphotin, binds 
calcium and contains a leucine zipper. 

A;Reference number: A47283; MUID : 93 165730 ; PMID : 8434 015 

A; Contents: photoreceptor cells 

A; Access ion: A472 83 

A; Status : prel iminary 

A;Molecule type: nucleic acid 

A;Residues: 1-873 <BAL> 

A; Cross-references : GB:L05080; NID:gl57071; PIDN : AAA28420 . 1 ; PID:gl57072 
A;Note: sequence extracted from NCBI backbone (NCBIN : 124958 , NCBIP : 124959 ) 
C-Superfamily: collagen alpha 1{I) chain; fibrillar collagen carboxyl -terminal 
homology; von Willebrand factor type C repeat homology 

Query Match 5.4%; Score 32 0.5; DB 2; Le 

Best Local Similarity 22.5%; Pred. No. 6e-06; 

Matches 235; Conservative 122; Mismatches 367; Indels 321; Gaps 50; 

Qy 59 PAAGLSAAPV-PTAPAAGAPLMDFGNDFVPPAPRGPLPAAP PVAPERQPSW-DPSP 112 

I = I I I I hi M = = I II I I I I I I I I I h | 

Db 9 PVSAPVAAPVTPSAVAAPVQWSPAAVAVAPAWAPAPAAPTAVTPVAP- - PPTLASVQP 66 

DEGQ 2330 

Qy 113 VSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPP A 163 

= llllhl-lhhl | | ||| | || ||| 

Db 67 ATVTVPAPAPIAAASVAP VASVAPPWAAPTPPA ASPVSTPPVAVAQI PVAV 118 

Qy 164 PAPAAPP STPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLET-- 217 

II III II II ::| | | 

Db 119 SAPVAPPVAATPTPVAP 1 PVAAPVIATPPVAASAPT 154 

QY 218 -AASXPSLSPLSAASFKEHEYLGNLSTV: ENVS EAS KE VS EKAltCTLL 1 DRDL/t 276 

II I --W-- I hi I I h : I - | 

Db 155 PAAVTPWSPVIAT PPWPANTT VPVAAPVAAVPAAVPWAPVLA 199 

Qy 277 EFSELEYSEMGSSFSVSPKAESAVI VAN PREEI I VKNKDEEEKLVSNNILHXQQE 331 

= 1 = 1 I =11 I II I 
Db 200 P AVAPAV- -APWAETPAPPPVAEI PVAT 226 

Qy 332 LPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDM 3 91 

= 1 = 1= I ll = = I =1111 1 | 

Db 227 IPECVAPLIPEVSWAT KPLAAAEPVWAPPATET PWAPAAASPH 272 

Qy 3 92 LAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCA 451 

: I : : : I M = =1=11 
Db 273 VSVAPAVETAWAPVS ASTEPPV AAATLTTA 3 02 

Qy 452 PFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDS 511 

IN I H I III 
Db 304 PET PAL APWAESQ VAA 320 

QY 512 ETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEV 571 

I I I II : I :| I =1 II :| I 

Db 321 NTWATPPTPAPEPETIAPPW AETPEVASVAVA-ETTPPW- -PPV 364 



Qy 


572 


MQESLYPA AQLCPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSS 


622 


Db 


365 


AAESI - PAPWATTPVPATliAVTDPDVTASAVPELPPVIAPSPVPSAVAETPVDLAPPVL 


423 


Qy 


623 


SPLEASSVNYESIKHEPENPPPYEEAMSV-SLKVSGIKEEIKEPENINAALQETEAPYIS 


681 


Db 


424 


h 1 1 =1111 ::: :| : : | | : | || :| 

PPVAAEPVPAWAEETPETPAPASAPVTIAALDI PEVAPVIAAPSDAPAEAPSAAAPIVS 


483 


Qy 


682 


I ACDLI KETKLSAEPAP DFSDYSEMAKVEQPVPDHSEL VEDSSPDSEPV 


730 


Db 


484 


-II 1 1 M 1 :| || h || \ 

TPPTTASVPETTAPPAAVPTEPIDVSVLSE-AAIETPVAPPVEVTTEVAVADVAPPEAAE 


542 


Qy 


731 


DLFSDDSIP- -DVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESF 


788 


Db 


3 


II = 1 H -I : h= = 1 -Ml 

DLIIEPVEPPAPIPDLLEQTTSVPAVEAAESTSSPIPE TSLTPfiBNEHJXASPEV 


595 


Qy 


789 


KLSLDNTKDTLLPDEVSTLSKKEKI PLQ MEELSTAVYSNDDLFISKEAQIRETETF 


844 


Db 


596 


■ ■ 1 1 : 1 II- -1 II 1 : II I 

AVAPITAPEPIPEPEPSLATPTEPIPVEAPWIQEAVDAV EVPVTETST 


644 


Qy 


845 


SDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEV-SHKSEIANAPDGAGSLPCTELP- - 


901 


Db 


645 


III | |: | : : ::|| ||| | ||: 

--SIP-ETTVEFPEAVAEKV LDPAITEAPVTTQEPDVANINDGA PATEITTP 


693 


Qy 


902 


HDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLL LP PDVS 


941 


Db 


694 


AVEIVTAAAEVSDTAIPLIDPPVPQEIAVA-EIPETETKPAEVIVEQSTIPIEAPVPEVS 


752 


Qy 


942 


ALG HTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSA 


980 


Db 


753 


H 1 =| 1 : II h h - III 
KYAEPVISEAPAAEVPITAGDNPDNTSVGISEW-PTIAEKPVEEVPTSEI PEQSSSPSD 


811 


Qy 


981 


I FSADLGKTS WDLLYWRD I KKTGV 1005 




Db 


812 


- - S VP VAK- - I TPLL - - RDLQTTDV 830 





RESULT 11 
T34418 

hypothetical protein F12F3.3 - Caenorhabdit is elegans 
C; Species: Caenorhabdit is elegans 

C;Date: 29-Oct-1999 #sequence_revision 29-Oct-1999 #text_change 29-Oct-1999 
C; Access ion: T34418 
R;Fulton, B.; Wohldmann, P. 

submitted to the EMBL Data Library, July 1998 
A;Description: The sequence of C. elegans cosmid F12F3 . 
A; Reference number: Z21521 
A; Accession : T34418 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A /Molecule type: DNA 
A/Residues : 1-3488 <FUL> 

A; Cross-references : EMBL:U80022; PIDN : AAC25885 . 1 ; GSPDB : GN00023 ; CESP:F12F3.3 

A; Experimental source: strain Bristol N2 ; clone F12F3 

C;Genetics : 

A;Gene: CESP:F12F3.3 

A; Map position: 5 



A;Introns: 281/3; 332/1; 562/3; 600/3; 1866/3; 1944/3; 3393/1 

Query Match 5.1%; Score 299.5; DB 2; Length 3488; 

Best Local Similarity 20.6%; Pred. No. 0.0003; 

Matches 272; Conservative 187; Mismatches 475; Indels 387; Gaps 57; 



Qy 


1 


MEDLDQSPLV- -SSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEV 

: : : : 1 1 : : 1 : : 1 1 : : : : 1 II: 1 1 : : 1 : : 1 


54 


Db 


340 


II I 1 I I 1 1 1 1 * 1 1 
VDEVDDSTVLEEKKDDGDDKSKPKTKKKI I KKKETPESEQVTAAEPEQQKISEVDVQSVA 


399 


Qy 


55 


LERKPAA GLSAAP VPT A PAAGA PLMD FGNDF 


85 


Db 


400 


-Ml ill: || || 

ETEVGAKKKPDAEKPTDLSKAKKDSKSKKSDEPEASTEEKSTTEKPTNDKTSKKSAEKKT 


459 


Qy 


86 


VPPAPR GPLPAAPPVAPERQPSWDPSPVSSTVPA PSPLSAAAV 


128 


Db 


460 


II 1 1 1 1 1 : = 1 1 h 1 1 1 : h : 
VKPKKEVTGKPLEAKKPVEDKKDASQPSSSKESSPPTDGKKKKQI PKALFI PDEISSRFG 


519 


Qy 


129 


SPSKL PEDDEPPARPP- -PPPPASVS PQAEPVWTPPA 




Db 


520 


| | : : 1 : | | | | | | : | | : : 
DPSTMHSETNITTTIRGREGSADAKTPLVEPLSASVSMKVFTLVESAKEKAEFSFKRRSE 


579 
164 
201 


Qy 


165 


A p AAPPSTPAAPK RRGSSGAWXXXXKIMD- LKEQPGN 


Db 


580 


1 1 l : : 1 : |: |:: | : | : : | | : 
TPDDKSRKKEGLPPAKKSEKKDEVTAEKQSTEALIESKKKEVDESKISEQQPSDKNKSEV 


639 


Qy 


202 


TI SAGQEDFPS VLLETAASXPSLS PLSAASFKEHEYLGMLSTVL PTEGTLQEN 

= 111 1 1 = 1 =111 : : : : 


254 


Db 


640 


VGVPEKAAGPETKKDV- -SEIEEVPKKKTIKKKTEKSDSSISQKSNVLKPADDDKSKSDD 


697 


Qy 


255 


VS EAS KE VS E KAKTLL I D RDLTEFSELE YSEMGSSFSVSP 


294 


Db 


698 


| : : | | : : | : | 1 I = I | : | I I : | 
VTDKSKKTTEDQTKVATDSKLEKAADTTKQI ETETVVDDKSKKKVLKKKTEKSDSF- ISQ 


756 


Qy 


295 


KAESAVI V- -ANPRE E 1 1 VKN KD E E E KL VSNN I LHXQQ EL P TALT KL VKE D 


343 


Db 


757 


KS ETP P WE PTKPAES EAQKI AE VNKAKKQKEVDDNL KREAE VAAKKI ADEKLKI EA 


813 


Qy 


344 


EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAG 


395 


Db 


814 


EANI KKTAEVEAAKKQKEKDEQLKLETEWSKKSAAEKLELEKQAQI KKAAEAD AVK 


870 


Qy 


396 


GKIESNLESKVD- -KKCFAD- -SLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCA 


451 


Db 


871 


: 1 1 = : I : = 1 I I I | | : : | : j : | | | : : | 
KQKELNEKNKLEAAKKSAADKLKLEEESAAKSKKVSEESVKF GEEKKTKAGEKTVQV 


927 


Qy 


452 


PFNPAATES I ATNI FPLL SENXTDEKKI EEKKAQI VTEKNSBIDKTSNPFFVAAQDS 
1 = ==l 1 1 1 1 1 :||| =11 |||: h :| 1 


511 


Db 


928 


ESEPTSKKTIDTKDVGATE-PADE- -TPKKKI IKKK TEKSDSS ISQKS 


972 


Qy 


512 


ETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEVTGTK IAYETKMDL 


565 


Db 


973 


II := : = l h 1 ■ 1 II :|: = 1 I || :|| : 

ATD SEKVSKQKEQDEPTKPAVSETQMVTEADKSKKQKETDEKLKLDAEIAAKTKQEA 


1029 


Qy 


566 


VQTSEV-MQESLYPAAQLCPSFEESEATPSPVLPD-IVMEAPLNSAVPSAGASVIQPSSS 


623 


Db 


1030 


■■ 1- II = ■■ hi | | : :|: : : || 

DEKSKLDAQEKI KKVS EDDAARKEKELNDKLKLESEI ATKKASADKLKLEEQAQ 


1083 



Qy 


624 


PLEASSVNYESIKHEPENPPPYE-EAMSVSLKVSGIKEEIKEPENINAALQETEAPYISI 


682 


Db 


1084 


:h 1 h 1 : 1 = = = 1 1 = 1 |:: = | | | : 
AKKAAEV- -EAAKKQKEKDEQLKLDTEAASKKAAAEKLELEKQAQIKKA AGADAVKK 


1138 


Qy 


683 


ACDLI KETKLSAE PAPDFS DYSEMAKVEQPVPDHSEL VE 


721 


Db 


1139 


: 1 - III : II =1 1 : = = | I 
QKELDEKNIOjE^KKSAAGKLKIEEESAAKSKQTVEEQAKLDAQTKAKTAEKQTKLEKDE 


1198 


Qy 


722 


DSSPDSEPVDLF- - SDDSI ' PDVP 


742 


Db 


1199 


h :|| : || || | 
KSTKESESKETVDEKPKKKVLKKKTEKSDSSISQKSETSKTWESAGPSESETQKVADAA 


1258 


Qy 


743 


QKQDET VMLVKESLTETSFESMI EYENKEKLSA LPP 


778 


Db 


1259 


HI II = hi 1 :| :| |:| | :| A | 
RKQKETDEKQKLEAE I TAKKSADE - - -KSKLEAESKLKKAAEVEAAKKQKEKDEQLKLDT 


1315 


Qy 


779 


EGGKPYLESFKLSLDNTKDTLLPDEVSTLSKK EKI PLQME 


818 


Db 


1316 


1 ■ II h II : h lh h : 

EAASKKAAAEKLELEKQSHIKKAAEVDAVKKQKELEEKQRLESEAATKKADAEKLKLEEQ 


1375 


Qy 


819 


ELSTAVYS NDDLFISKEAQIRETETFS DSSPIEIIDEFP 


857 


Db 


1376 


' 1 = h III hll 1 : :| | :|| | 
KKKAAEIALIEIQKEQEKLAQEQSRLEDEAKKSAEKQKLESETKSKQTEEAPKESVDEKP 


1435 


Qy 


858 


- - TLI SSKTDSFSKLAREYTDLEVSHKSE I ANAP -DGAGSLPCTELPHDLSLKNI QPKVE 
- M 1 =1 =1 lh 1 = 1 1 =| 1 :| | ||| 


914 


Db 


1436 


KKKVLKKKT EKSDSS I SQKSKSAKSTVDAAETL E S D FNLVE KKT VQKVE 


1484 


Qy 

Db 


915 
1485 


EKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVK 

| | | : : | Ml TRTSl()t 
QSPDESTSATIKRDPAQKTEEISKQDDGDEKKTTTDGKPPKPEDSEA TPKKRWK 


955 

1901 

1539 


Qy 


956 


p K VLEKEAEKKLPSDTEKEDRSPSAI FSADLGKTSWD 


993 


Db 


1540 


1 III HI 1 1 1 1 1 III I : 

KKTQKSDS VASDASLADVSKLSDDVEEKPKKKVLKKKTEKSDS VI SETSSVDTI KPESVE 


1599 


Qy 


994 


L 994 




Db 


1600 


I 1600 





RESULT 12 
S37431 

ankyrin 2, neuronal long splice form - human 

N;Alternate names: ankyrin B, 440K splice form; ankyrin-B; brain ankyrin; non- 
erythroid ankyrin 

N; Contains: ankyrin 2, short form 
C; Species: Homo sapiens (man) 

C;Date: 06-Jan-1995 #sequence_revision 06-Jan-1995 #text_change 13-Aug-1999 
C;Accession: S37431; A39643; B39643; A40334; A49462; S14533; S14569 
R ; Chan , W . 

submitted to the EMBL Data Library, September 1993 
A;Reference number: S37431 
A c SM4B,a:i] ,425Xt 

A; Status: preliminary 
A; Molecule type: mRNA 



A;Residues: 1-3924 <CHA> 

A; Cross -references: EMBL:Z26634; NID:g406287; PIDN : CAA81387 . 1 ; PID:g406288 
R;Otto, E . ; Kunimoto, M.; McLaughlin, T. ; Bennett, V, 
J. Cell Biol. 114, 241-253, 1991 

A; Title: Isolation and characterization of cDNAs encoding human brain ankyrins 

reveal a family of alternatively spliced genes. 

A; Reference number: A39643; MUID : 913 02466 ; PMID:1830053 

A; Access ion : A3 9643 

A; Status: preliminary 

A; Molecule type: mRNA 

A;Residues: 1-2077 <0T1> 

A; Cross -references : GB:X56 957 

A;Accession: B39643 

A; Status : prel iminary 

A;Molecule type: mRNA 

A/Residues : 1-1443,3585-3924 <OTT> 

A; Cross-references : EMBL:X56958 

R; Tse, W.T.; Menninger, J.C.; Yang-Feng, T.L. ; Francke, U. ; Sahr, K.E.; Lux, 
S.E.; Ward, D.C.; Forget, B.G. 
Genomics 10, 858-866, 1991 

A;Title: Isolation and chromosomal localization of a novel nonerythroid ankyrin 
gene. 

A;Reference number: A40334; MUID : 92009921 ; PMID: 1833308 
A /Access ion: A4 0334 
A; Molecule type: DNA 

A;Residues: 463 -474 , ' PE ' , 477 -4 95 <TSE> 

A; Cross-references: GB:M37123; NID : gl78647 ; PIDN : AAA62828 . 1 ; PID:gl78648 
R;Chan, W.; Kordeli, E.; Bennett, V. 
J. Cell Biol. 123, 1463-1473, 1993 

A; Title: 44 0-kD ankyrinB : structure of the major developmentally regulated 
domain and selective localization in unmyelinated axons. 
A/Reference number: A49462; MUID : 940754 09 ; PMID: 8253844 
Accession: A49462 

Status: preliminary; nucleic acid sequence not shown 
Molecule type: mRNA 
Residues: 1-3924 <RES> 

Cross-references: EMBL:Z26634; NID : g4 06287 ; PIDN : CAA8 1387 . 1 ; PID:g406288 
Genetics : 
Gene: GDB : ANK2 

Cross-references: GDB:127607; OMIM:106410 
Map position: 4q25-4q27 

Super family: ankyrin; ankyrin repeat homology 
Keywords: alternative splicing 

2-3924/Product : ankyrin 2, long form #status predicted <MAT> 

2-li -3924/Product : ankyrin 2, short form #status predicrheffl <MA2©n50Xm 

63-95/Domain: ankyrin repeat homology <AN01> 
96 -12 8 /Domain: ankyrin repeat homology <AN02> 



129-161/Domain 
162-190/Domain 
191-223/Domain 
232 -2 64 /Domain 
265-297/Domain 
298-330/Domain 
331-363/Domain 
364-396/Domain 
397-429/Domain 
430-462/Domain 



ankyrin repeat homology <AN03> 
ankyrin repeat homology <AN04> 
ankyrin repeat homology <AN05> 
ankyrin repeat homology <AN06> 
ankyrin repeat homology <AN07> 
ankyrin repeat homology <AN08> 
ankyrin repeat homology <AN09> 
ankyrin repeat homology <AN10> 
ankyrin repeat homology <AN11> 
ankyrin repeat homology <AN12> 



F;463-495/Domain: ankyrin repeat homology <AN13> 

F; 496-528/ Doma in : ankyrin repeat homology <AN14> 

F; 52 9-561/Domain : ankyrin repeat homology <AN15> 

F; 562 -5 94 /Doma in : ankyrin repeat homology <AN16> 

F; 595-627/ Doma in : ankyrin repeat homology <AN17> 

F; 628 -66 0/Domain : ankyrin repeat homology <AN18> 

F; 661-693/ Doma in : ankyrin repeat homology <AN19> 

F; 694 -7 26 /Domain : ankyrin repeat homology <AN2 0> 

F; 727 -75 9/Domain : ankyrin repeat homology <AN21> 

F; 76 0-792/Domain : ankyrin repeat homology <AN22> 

F;793-825/Domain: ankyrin repeat homology <AN23> 

Query Match 5.0%; Score 299; DB 2; Length 3 924; 

Best Local Similarity 21.5%; Pred. No. 0.00037; 

Matches 258; Conservative 167; Mismatches 417; Indels 358; Gaps 62; 

Qy 14 DSPP RPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLE ELEVLERK 58 

I II :| I | :::::::: | | : | | | | 

Db 1648 DI PPDETQSTQKQHKPSLGI KKPVRRKLKEKQKQKEEGLQASAEKAELKKGSSEESLGED 1707 

Qy 59 PAAGLSAAPVPTAPAAGAPLMD FGNDFVPPAPRG 92 

I Ih 1 11 I :|h: I :| :| 

Db 1708 P--GLAPEPLPTVKAT-SPLIEETPIGSIKDKVKALQKRVEDEQKGRSKLPIRVKGKEDV 1764 

Qy 93 PLPAA- PPVAPERQPSWDPSP VSSTVPAPSPL 123 

III : II III II Ih 

Db 1765 PKKTTHRPHPAASPSLKSERHAPGSPSPKTERHSTLSSSAKTERHPPVSPSSKTEKHSPV 1824 

Qy 124 SAAA VSPSKLPEDDEPPA RPPPPPPASVSPQAEPVW 15 9 

I = 1 I I I I I I = I II h : : II 

Db 1825 SPSAKTERHSPASSSSKTEKHSPVSPSTKTERHSPVSSTKTERHPPVSPSGKTDKRPPV- 1883 

Qy 160 TPPAPAPAAPPSTPAAPKRR GSSGAWXXXXKIMDLKEQPGNTI SAGQEDFPSVLLE 216 

: I II : I = = l II I II =1 - I 

Db 1884 SPSGRTEKHPPVSPGRTEKRLPVSPSGRTD KHQPVSTAGKTEKHLPVSPSG 1934 

Qy 217 TAASXPSLSPLSAAS-FKEHEYLGNLSTVLPT EGTLQENVS EAS KE VS EKAKT 268 

I HI I =| : I : : I h I h II I 

Db 1935 KTEKQP P VS PTS KTER I EETMS VRELMKAFQSGQDPS KHKTGLFEHKSAKQKQPQEKGKV 1994 

Qy 269 LLIDRDLTEFSELEYSEMGSSFSVSPKAES- -AVIVANPREEI I VKNKDE- - 316 

:\ \: : :\ : : \ |: ||| | |:: | || :| 

Db 1995 R VE KE KG P I LTQRE - AQKTENQT I KRGQRLP VTGTAES KRG VRVS S IGVKKEDAAG 2049 

Qy 317 -EEKLVSNNI LHXQQELPTALTKLVKE DEW 346 

HI-!: I I = 1 = 1 I : I Ih 

Db 2050 GKEKVLSHKIPEPVQSVPEEESHRESEVPK--EKMADEQGDMDLQISPDRKTSTDFSEVI 2107 

Qy 7 SSE-KAKDSFNEKRVAVEAPMREEYAD FKPFERVWEVKDSKED {0--- 388 

Db 2108 KQELEDNDKYQQFRLSEETEKAQLHLDQVljTSPFNTTFPLDYMKDEFLPALSLQSGALDG 2167 

Qy 38 9 -SDMLAAGGKIESNLESKVDKKCFADSLEQTNHE KDSESSNDDTSFPSTPEGI KDRS 444 

1= I I I I - II II = | = | : II :|: 

Db 2168 SSESLKNEGVAGS PCGSLMEGTPQI SSEES YKHEGLAETPETSPESLSF - - S PKKSEEQT 222 5 

Qy 44 5 GAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPF 504 



Db 


2226 


G ETKESTKTETTTEI RSEKEHPTTKDI TGGS - - EERGATVTEDSETSTESFQKE 


2277 


Qy 


505 


FVAAQDSETDYVTTDNLTKVTEEWA-NMPEGLTPDLVQEACESELNEVTGTKIAYETKM 


563 


Db 


2278 


M hi =11 III :|| 1 |= h = l 

ATLGS PKDTS PKRQDDCTGS CS VALAKETPTGLT EEAACDEGQRTFGSS-AHKT- - 


2330 


Qy 


564 


DLVQTSEVMQESLYPAAQLCPSFEESEATPSPVL -PDIVMEAP LNSAV 


610 


Db 


2331 


II III = :|-| II | |: :| | |: 
QTDSEAQES TATSDETKALPLPEASVKTDTGTESKPQGVIRSPQGLELAL 


2380 


Qy 


611 


PSAGASVIQ PSSSPLEASSVNYESI KHE-PEN- - PPPYEEA-MSVSLKVSG 


657 


Db 


2381 


II ■ h MM 1 =: h h: 1 1 Ih I 

PSRDSEVLSAVADDSLAVSHKDSLEASPVLEDNSSHKTPDSLEPSPLKESPCRDSLESSP 


2440 


Qy 


658 


I KEE I KE PEN - - 1 NAALQETE - - AP YI S I ACDLI KETKLSAE PAPDFSDYS EMAKVE 


710 


Db 


2441 


- ::| 1 = = Ih Ml h h- II 1 M M 

VEPKMKAGIFPSHFPLPAAVAKTELLTEVASVRSRLLRD PDGS- -AEDDSLE 


2490 


Qy 


711 


QPVPDHSELVEDS SPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFESMI 


765 


Db 


2491 


1 = hi 1 llh = 1 = 1 -II 

Q TSLMESSGKSPLSPDTPSSEEVSYEVTPKTTDVSTPKPAVIHECAEED 


2539 


Qy 


766 


EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKK EKI PLQMEELS 


821 


Db 


2540 


■WW II 1 Ih II : Ih : 1 : MINI 

DSENGEKKRFTPEE EMFKMV TKI KMF-DELEQEAKQKRDYKKEPKQEESSS 


2589 


Qy 


822 


TAVYSNDDLF I S KEA - QI RETETFSDSS P I E 1 1 DEFPTLI SS KTDS FS KLARE YTDLEVS 


880 


Db 


2590 


: |: | | : : : | : | | : | |::|:: | : 

S SDPDADCSVDVDEPKHTGSGEDESGV PVLVTSESRKVSSSSES 


2633 


Qy 


881 


HKSEIANAPDGAGS LPCTELPHDLSLKN IQPKVEEKISF- -SDD 


922 


Db 


2634 


■ hi III 1 : II : : | | | : : : | = M 

-EPELAQLKKGADSGLLPEPVIRVQPPSPLPSSMDSNSSPEEVQFQPWSKQYTFKMNED 


2692 


Qy 


923 


FSKNGSATSKVLLLPPDVSALGHTQAEI ESI VKPKVLEKEAEKKLPSD TEKEDRS 


977 


Db 


2693 


■ ' lh = l 1 1 II MM 
TQEEPGKSEE EKDSESHLAEDRHAVSTEAEDRS 


2725 



RESULT 13 
T16251 

hypothetical protein F35A5.1 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 20-Sep-1999 #sequence_revision 20-Sep-1999 #text_change 18-Feb-2000 
C; Access ion: T162 51 
R;Leimbach, D. 

submitted to the EMBL Data Library, January 19 96 

A; Description: The sequence of C. elegans cosmid F35A5. 

A; Reference number: Z18485 

A /Access ion : T16251 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A /Molecule type: DNA 
A/Residues : 1-1274 <LEI> 

A; Cross-references : EMBL :U4 6675; NID :gll 66613 ; PID : gl 16 6621 ; PIDN : AAB52641 . 1; 
GSPDB:GN00028; CESP:F35A5.1 

A; Experimental source: strain Bristol N2 ; clone F35A5 



C;Genetics : 
A; Gene: CESP:F3 5A5.1 
A ; Map position: X 
A;Introns: 1272/2 

Query Match 4.9%; Score 290.5; DB 2; Length 1274; 

Best Local Similarity 20.5%; Pred. No. 0.00017; 

Matches 221; Conservative 126; Mismatches 417; Indels 313; Gaps 45; 

Qy 13 SDSPPRP- -QPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPAAGLSAAPVPT 70 

I : I I I II h = I Mill I Ih I : I I ^ || 

Db 2 SRAPPTPIKNPAKKWKPPWESVDEEEEME VDEETPAPSKLEKKPSLKRKDAPTKP 56 

Qy 71 APAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAPSPLSAAAVSP 13 0 

1 = III hi II : | | |: || | 
Db 57 VPSPGAP SPVPIKNPVKKWKAPWEDDEPMEEAPAAP VP 94 

Qy 131 SKLPEDDEP PARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWX 187 

= 1 I I Ihl I = I II II II 

Db 95 AKKVRDPSPKKVPAKPRDASPKKIMAAKK EPETLPAVPP-TPVKNPVKKFKAPWED 149 

Qy 18 8 XXXKI MDLKEQPGNTI SAGQEDFPS VLLETAASXP - SLS PLSAASFKEHEYLGNLSTVLP 24 6 

= 1 = 1 = I l : I : Ihl II II Mh : I | 

Db 150 DEVDVEDVKDAP - - TVPAKKTPVLKKKEPAAAAKPRDPS PKKAAPSKEHDP I VPP 202 

Qy 247 TEGTLQENVS EAS KE VSEKAKTLLI DRDL - - TEFS ELE YSEMGSSFSVSPKA 296 

I I * : I I I - I I I : ||||: 

Db 203 T P I KNPAKKWKP PWE DDE VP TEE I KE PEPATRKVPALKKKEPSTS VKP VS 252 

Qy 297 ESAVI VANPREEI I VKNKDE EEKLVSNNI LHXQQELPTALTKL 339 

Db 253 D PSPTKKVPVKKEPEVPPTPIKNPTKKWKPPWEDETPVEEV- -KEPPVPEKKAPV 305 

Qy 34 0 VKEDEWSSEKAKDSFNEKR VAVEAPMREEYADFKPFERVWEVKDSKEDS 38 9 

: I : Ihl I I I- HI III I : 

Db 3 06 L KKKD PA P AAKARD P S P S KAA P KKVE P S S PWP PT P VKN P VKKY KP PWEVDDEPAE- 361 

Qy 390 DMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYIT 449 

I I =11 I- I I : : I I =| I : 

Db 362 EVKKPSAPEKK- -TPVLKRKEPEPSSTTPSSDPSPKKAAPAVKPRDSSPKK 410 

Qy 450 CAPF- -NPAATESIATNIFPLLEDPTSE NXTDEKKI EE - KKAQI VTEKNTSTKTS 501 

1=1111= =H = II :|| 1= = =1 I 

Db 411 ATPLQADPKAQEVPPTPV KNPVKKYKPPWEVDDEDPVEEVKQPEAPAKKTPVLKRK 466 

Qy 502 N P FF VAAQDS ETD Y VTTDNLTKVTE E WANM PE GLTPDLVQ 542 

I Ihl I III II III 

Db 467 EP AAKD TAKPATSKTPET PEKKDPVKPRDSS PKKVAAKPDSAQAPAT 513 

Qy 543 EACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEES 58 9 

I h -h I H : :||| I | 

Db 514 PVKNPVKKWRPPWEDDETPADDVSKPTDAKKTPSLAKKDPAPAKESLKPKADTKAPAKPR 573 

Qy 590 EATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAM 649 

= =1 M | h II I I : I : I I : 

Db 574 DPSPKKVAP TAPEKKTPVLAKKEPAGPADSKTKEPEKSKPRDPSPKKAVP 623 



Qy 650 S VSLKVSGI K- - EEI KEPENINAALQETEAPYI SI ACDLI KETKLSAEPAPDFSD 702 

: : :| | | :|:: | :| : | | 

Db 624 AKP VPKTE VAPAAVKKP E P I S KPKDTAP KKAE PNS PW PPTP 665 



Qy 7 03 YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 762 

h II II =:|| I h : :| :| |:= |= | 

Db 666 VKNPVKKWKPPWEDDDAPAKPVSL PEPEKKTPVLAKKAPTKPDSE 710 

Qy 763 SMI EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELST 822 

= 1 I I 

Db 711 AAADPVSGP 719 

Qy 823 AVYSNDDLFISKEAQIRETETFSDSSPIEI I DEFPTLI SSKTDSFSKLAREYT 875 

h I -hi - I II:: : | | : | : :| 
Db 720 SSKDPKLAKKAPVKP RDPS PMKAVP I KPAPKTE VP PAWKKPEPVAK 766 

Qy 876 DLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVE EKI SFSDDFSKNGSA 929 

: I I I I : :| I I : I •] I I - : I 

Db 767 SRDPSPKK- -AKAEPNSPWPPT- -PVKNPVKKWKPPWEDDDAPAEPVNVPEPEKKTPVL 822 

Qy 930 TSKVLLLPPDVS ALGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSP 978 

I = I I I II I = 11= = I I II : I II 

Db 823 AKKTPVKPRDPSPKKAVPAKPSTKTDAPPVSVKKPEPVSKPKE PSPKKAEPNSP 876 



RESULT 14 
A56577 

microtubule-associated protein MAP IB - rat (fragment) 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 21-Jul-1995 #sequence_revision 21-Jul-1995 #text_change 16-Feb-1997 
C; Access ion: A56577 

R;Zauner, w. ; Kratz, J. ; Staunton, J. ; Feick, P.; Wiche, G. 
Eur. J. Cell Biol. 57, 66-74, 1992 

A; Title: Identification of two distinct microtubule binding domains on 
recombinant rat MAP IB. 

A;Reference number: A56577; MUID : 92347374 ; PMID: 1639092 

A /Access ion : A56577 

A; Status: preliminary 

A; Molecule type: mRNA 

A; Residues: 1-2364 <2AU> 

A; Cross-references : GB :X6 055 0 

A ; Experimental source : bra in 

A;Note: nucleotide sequence not given; conceptual translation not complete 
C; Superfamily : microtubule-associated protein MAP1B 

Query Match 4.9%; Score 28 9; DB 2; Length 2364; 

Best Local Similarity 20.9%; Pred. No. 0.00047; 

Matches 275; Conservative 158; Mismatches 462; Indels 422; Gaps 61; 

Qy 30 EPEDEEEEEEEEEEDEDEDLEELEVLERKPAAGLSAAPVPTAPAAGAPLMDFGNDFVPPA 89 

I I III M :: || :: | | | | || : ||: 
Db 909 EAEQSEEEGEEEEDKAEDAREEDHEPDKTEAEDYVTVIAVVDKAAEAGVTEDQY--DFL 963 

Qy 90 PRGSEiBXAPP- -VABERJ&PSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPP *p975X9 

II I = 1 hh I hi I II hi I 

Db 964 GTPAKQPGVQS PSREPA- - SSI HDETLPGGSESEATA SDEENREDQPEEF 1011 



Qy 148 PASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQ 207 

1 = hi =| : | | | 

Db 1012 TATSGYTQSTIEISSEPTPMDEMSTP RDVMTDETNNEETESPSQ 1055 

Qy 208 E DFPSVLLETAASXP SLSPLSAASFKE HEYLGNLSTVLPTEGTLQENV 255 

I =11 I I I = II I : : I 11= I 

Db 1056 EFVNITKYESSLYSQEYSKPWASFNGLSDGSKTDATDGRDYNASASTISPPSSMEEDKF 1115 

Qy 2 56 SEAS KEVSEK AKTLLI DRDLTE 277 

h I h : || | =1 : 

Db 1116 S KSALRDAYRPEETDVKTGAELD I KDVSDERLSPAKS PSLSPSPPSPI EKTPLGERS V- N 1174 

Qy 278 FS ELEYSEMGSSFS-VSPKAESAVI VANPRE EIIVKNK 314 

II h = I I = = Ml II = 1=1 I l = = = = 

Db 1175 FSLTPNE I KASAEGEATAWS PGVTQAWEEHCAS PEEKTLE WS PSQSVTGSAGHTPYY 1234 

Qy 315 DEEEKLVSNNILHXQQELPTALTKLVKEDEWSSE--KAKDSFNEKRVAVEAPMRE 368 

lh : : I :| II I :||| Ih : :|| | 

Db 1235 QSPTDEKSSHLPTEVTENAQAVP VSFEFTEAKDE-NER- -SSISPMDE 1279 

Qy 369 EYADFK- PFERVWE -VKDSKED SDMLAAGGKI ESNLESKVDKKCFADSL 415 

I = I 1 = 1 = = II : I 11 = 11 M I = hi 

Db 1280 PVPDSESPIEKVLSPLRSPPLIGSESAYEDFLSADDKALGRRSESPFEGKNGKQGFSD- - 1337 

Qy 416 EQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNP AATESIATNI FPLLED- 471 

1 = 1 I I II : : hi |:| h I - II 

Db 1338 KESPVS-DLTSDLYQDKQEEKRAGFI PIKEDFSPEKKASDAEIMSSQSALALDE 13 90 

Qy 4 72 PTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 

M ■ ^ : I ==l I h= I 1 = 1= f 
Db 1391 RKLGGDGSPTQVDVSQFGSFKEDTKMSISEGTVSDKSATPVDEGAEDT YSHMEGVAS 1447 

Qy 524 VTEEWA- -NMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQES-LYPAA 580 

h II = II I I I = =h h= = :||| ||= = 1 = 

Db 1448 VSTASVATSSFPEPTTDD-VSPSLP1AEVGSPHSTEVDDSLSVSWQTPTTFQETEMSPSK 1506 

Qy 581 QLCP SFEESEATPSPVLPDIVMEAPLNSAV P 611 

Ml :: : I : : |:| :| | 

Db 1507 EECPRPMSISPPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEHSLAMDFSRQSPDHP 1566 

Qy 612 SAGASVIQ PSSSPLEASSVNYESIKHE- -PENPPPY EEAMSVS-LKVSG 657 

: II - h I = h h I I I I :||| - | 

Db 1567 TVGAGMLHITENGPTEVDYSPSDIQDSSLSHKIPPTEEPSYTQDNDLSELISVSQVEASP 1626 

Qy 658 I KEEI KEPENINAALQETE APYI S I ACDLI KE TKLSAEP 696 

I I : Ml = | h| = == Ml : 

Db 1627 STSSAHTPSQIASPLQEDTLSDWPPRDMSLYASLASEKVQSLEGEKLSPKSDISPLTPR 1686 

Qy 697 APDFSDYSEMAK VEQPVPDHSELVEDS 723 

= 1 III =11 =11 == h 

Db 1687 ESSPTYSPGFSDSTSGAKESTAAYQTSSSPPIDAAAAEPYGFRSSMLFDTMQHHLALSRD 1746 

Qy 724 SPDSEPVDLFSDD SIPDVP QKQD 746 

llllll: II =1 : 

Db 1747 LTTSSVEKDNGGKTPGDFNYAYQKPESTTESPDEEDYDYESHEKTIQAHDVGGYYYEKTE 18 06 

Qy 74 7 ETVML VKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKD 797 



Db 


1807 


RTIKSPCDSGYSYETIEKTTKTPEDGGYSCEITEKTTRTPEEGGYSY EISEK 


1858 


Qy 


798 


TLLPDEVS - -TLSKKEKI PLQMEELSTAVYSNDD - - LFISKEA 


836 


Db 


1859 


1 III 1 1 h --I :| | |: 
TTRTPEVSGYTYEKTERSRRLLDDI SNGYDDTEDGGHTLGDCS YSYETTEKI TSFPESES 


1918 


Qy 


837 


QIRETETFSDSSP I EI IDEFPTLISSKTDSFSKLAREYT . DL 


877 


Db 


1919 


II 1 = II 1 = 1 h 1 II II 
YSYETTTKTTRSPDTSAYCYETMEKITKTPQASTYSYETSDRCYTPERKSPSEARQDVDL 


1978 


Qy 


878 


EVSH - KSE I ANA PDG AGSLPCTELPHDLSLKNIQP KV 


913 


Do 


1979 


1 1 hi- : h II 1 1 h 1 
CLVSSCEFKHPKTELSPSFINPNPLEWFAGEEPTEESERPLTQSGGAPPPSGGKQ0^QC 


2 


Qy 


914 


EEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKK-LPSD-- 


970 


Db 


2039 


DETPPTSVSESAPSQTDSDV PPETE ECPSITADANLDSEDESETI PTDKT 


2088 


Qy 


971 


TEKEDRSPS A I FSADLGKTSWDLLYWRDI KKTG 1004 


Db 


2089 


HI Mi :||| || | | | 
VTYKHMDPPPAPMQDRSPSPRHPDVSMVDPEALAI EQNLGKALKKDLKEKAKTKKPG 214 5 



RESULT 15 
QRMSP1 

microtubule-associated protein MAP1B - mouse 

N; Alternate names: microtubule-associated protein MAPI (X) ; microtubule- 
associated protein MAPI. 2; microtubule-associated protein MAP5 
C; Species: Mus musculus (house mouse) 

C;Date: 30-Sep-1991 #sequence_revision 30-Sep-1991 #text_change 01-Sep-2000 
C;Accession: S07549; S44387; A33645 
R/Noble, M . ; Lewis, S.A.; Cowan, N.J. 
J. Cell Biol. 109, 3367-3376, 1989 

A; Title: The microtubule binding domain of microtubule-associated protein MAP1B 

contains a repeated sequence motif unrelated to that of MAP2 and tau. 

A; Reference number: A33645; MUID : 90094539 ; PMID:2480963 

A /Access ion: S0754 9 

A ,-Molecule type: mRNA 

A/Residues: 1-2464 <N0B> 

A; Cross-references: EMBL:X51396/ NID:g52999; PIDN : CAA35761 . 1 / PID:g53000 
R; Sanchez, C. / Padilla, R. / Paciucci, R. ; Zabala, J.C.; Avila, J. 
Arch. Biochem. Biophys. 310, 428-432, 1994 

A/Title: Binding of heat-shock protein 70 (hsp70) to tubulin. 

A/Reference number: S44387; MUID: 94234720 ; PMID:8179328 

A /Access ion: S443 8 7 

A; Status: preliminary 

A; Molecule type: protein 

A/Residues: 653-663,'IC <SAN> 

C; Super family: microtubule-associated protein MAP1B 

C; Keywords: microtubule binding,- phosphoprotein; tandem repeat 

F / 58 9 -78 6 /Domain : microtubule binding #status experimental <MTB> ' 

F; 589-592, 639-642, 649-652, 655-658 ,660 -663 ,668 -671 ,674 -677, 679-682 ,683 -686, 687- 

690,691-694,695-698, 699-702 ,708 -711, 712 -715, 716-719, 72 0-723, 727-730, 758 -761, 764- 
767, 783-786/Region: 4-residue repeats (K/R-K-E/D-X) 
F; 18 61 -2 064 /Region : 17-residue repeats 



F; 91, 116,351, 888,1124, 1153, 1168, 1208, 1662, 1877, 1918, 2003, 2030, 2054, 2083/Binding 
site: phosphate (Ser) (covalent) #status predicted 

F; 147, 96 9, 133 6, 1562, 1563, 1702, 1708, 1990, 2057, 2 063, 24 19/Binding site: phosphate 
(Thr) (covalent) #status predicted 

F; 1953 /Binding site: phosphate (Tyr) (covalent) #status predicted 

Query Match 4.9%; Score 289; DB 1; Length 2464; 

Best Local Similarity 19.8%; Pred. No. 0.0005; 

Matches 262; Conservative 148; Mismatches 474; Indels 442; Gaps 55; 



Qy 


32 


EDEEEEEEEEEEDEDEDL - EELEVLERKPAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAP 


90 


Db 


1009 


i | - |j| 1 1 1 1 : | ! | | : : | | | | : | 
EAEQSEEEGEEEDKAEDAREEGYEPDKTEAEDYVMAVADKAAEAGVTEEQYGY - - - 


1061 


Qy 


91 


RGPLPAAPPV-APERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPA 


149 


Db 


1062 


1 | : : | h | : | | : | | | | | : | | | 
LGTSAKQPGI QSPSREPA- -SSI HDETLPGGSESEATA SDEENREDQPEEFTA 


1112 


Qy 


150 


SVSPQAEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQED 


209 


Db 


1113 


TSGYTQSTI EI SSEPTPMDEMSTP RDVMSDETNNEETESPSQEF 


1156 


Qy 


210 


FPSVLLETAASXPSLSPLSAASFK EHEYLGNLSTVLPTEGTLQENVSE 


257 


Db 


1157 


h = 1 : 1 1 1 : | : | | : | : : | : 
VNITKYESSLYSQEYSKPAVASFNGLSEGSKTDATDGKDYNASASTISPPSSMEEDKFSK 


1216 


Qy 


258 


AS KEVSEK AKTLLIDRDLTEFS 


279 


Db 


1217 


Ml- | | ; : | : || 

SALRDAYCSEEKELKASAELDIKDVSDERLSPAKSPSLSPSPPSPIEKTPLGERSV-NFS 


1275 


Qy 


280 


ELEYSEMGSSFSVSPKAESAVI VANPRE EI I VKNK 


314 


Db 


1276 


1 - I 1 : 1 1 1 1 1 1 = 1 : 1 1 1 = = 
■ ii i i i i ii iii i 

LTPNEIKVSAEGEARSVSPGVTQAWEEHCASPEEKTLEWSPSQSVTGSAGHTPYYQSP 


1335 


Qy 


315 


-DEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADF 


373 


Db 


1336 


M = 1 : 1 : = | | | | | : | : | | | | 
TDEKSSHLPTEVSENAQAVPVSF EFSEAKDE-NER- -ASLSPMDEPVPDS 


1382 


Qy 


374 


K-PFERVWE VKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSE 

iii tiii ii 


425 


Db 


1383 


1 1=1 = 1 h 1 h II 

ES PVEKVLSPLRS PPLLGSES PYEDFLSADSKVLGR RSESPFE 


1425 


Qy 


426 


SSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXT 


478 


Db 


1426 


III : 1 H -II. 

GKNGKQGFPDRESPVSD LTSTGLYQDKQEEKSTGFI PI KEDFGPEKKTSDVETMS 


1480 


Qy 


479 


DEKKI EEKKAQ I VTEKNTSTKTSNPFFVAAQDS ETD 

Ihl: :| -1 11-1 | :| 


514 


Db 


1481 


SQSALALDERKLGGDVSPTQI DVSQFGSFKEDTKMS I SEGTVSDKSATP - - VDEGVAEDT 


1538 


Qy 


515 


YVTTDNLTKVTEEWA- -NMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVM 
1 = = h II : II 1 1 1 : :h | = : : :||| 


572 


Db 


1539 


YSHMEGVASVSTASVATSSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTF 


1597 


Qy 


573 


QES - LYPAAQLCP SFEESEATPSPVLPDI VMEAPLNSAV- - 


610 


Db 


1598 


11= : h : || :: : | : : |:| :| 
QETEMSPSKEECPRPMSISPPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEHSFAMD 


1657 



Qy 611 PSAGASVIQ PSSSPLEASSVNYESIKHE- -PENPPPY EEAM 649 

h ||||: |: hi- I II I : 

Db 1658 FSRQSPDHPTLGASVLHITENGPTEVDYSPCDIQDSSLSHKIPPTEEPSYTQDNDLSELI 1717 

QY 650 SVS-LKVSGIKEEIKEPENINAALQETE APYISIACDLIKE TKLSAE 695 

Ml - I I I : III : | |:| : :: ||| : 

Db 1718 SVSQVEASPSTSSAHTPSQIASPLQEDTLSDWPPREMSLYASLASEKVQSLEGEKLSPK 1777 

Qy 696 P APDFSDYSEMAK VEQPVPDHSELVEDS- 723 

M Ml • II Ml - h 

Db 1778 SD I S PLTPRESS PLYS PGFSDSTSAAKETAAAHQASSS P P I DAATAE PYGFRSSMLFDTM 1837 

Qy 724 SPDSEPVDLFSD DS 737 

III I I I I 

Db 1838 QHHLALNRDLTTSSVEKDSGGKTPGDFNYAYQKPENAAGSPDEEDYDYESQEKTIRTHDV 1897 

Qy 73 8 I PDVPQKQDETVML VKESLTETSFESMI EYENKEKLSALPPEGGKPYLESF 78 8 

Db 1898 VRYYYEKTERTIKSPCDSGYSYETIEKTTKTPEDGGYTCEITEKTTRTPEEGGYSY 1953 

Qy 789 KLSLDNTKDTLLPDEVS - -TLSKKEKI PLQMEELSTAVYSNDD 829 

: - I III I I h --I M 

Db 1954 EI SEKTTRTPEVSGYTYEKTERSRRLLDDI SNGYDDTEDGGHTLGDCSYSYETTEK 2009 

Qy 830 - -LFISKEAQIRETETFSDSSP 1 EI IDEFPTLISSKTDSFSKLAREYT 875 

I h II I : II I : | |: | || 

Db 2010 ITSFPESESYSYETSTKTTRSPDTSAYCYETMEKITKTPQASTYSYETSDRCYTTEKKSP 2069 

Qy 876 DL EVSH-KSEIANA PDG AGSLPCTELPHDLSLKNI QP 911 

II II I •• I : : : h II I I h I 

Db 2070 SEARQDVDLCLVSSCEFKHPKTELSPSFINPNPLEWFAGEEPTEESEKPLTQSGGAPPPS 2129 

Qy 912 KVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESI VKPKVLEKEAE 964 

Db 2130 GGKQQGRQCDETPPTSVSESAPSQTDSDV PPETE ECPSITADANIDSEDE 2179 

QY 965 KK-LPSD TEKEDRSPS A I FSADLGKTS WDLLYWR 998 

■ = 1 = 1 :||||| :||| || 

Db 218 0 SET I PTDKTVTYKHMDPPPAPMQDRSPSPRHPDVSMVDPDALAVDQNLGKAVKKDLKEKT 2239 

Qy 999 DIKKTG 1004 

II I 

Db 2240 KTKKPG 2245 



Search completed: January 22, 2004, 16:32:56 
Job time : 45.1516 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



January 22, 2004, 16:31:15 ; Search time 61.624 Seconds 

(without alignments) 
4932.919 Million cell updates/sec 

US-09-830-972-29 
5923 

1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKI PGLKRKAE 1178 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



830525 seqs, 258052604 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



830525 



Database 



SPTREMBL__23 : * 

1 : sp_archea : * 

2: sp_bacteria : * 

3 : sp_f ungi : * 

4 : sp_human : * 

5: sp__invertebrate : * 

6: sp_mammal:* 

7 : sp_mhc : * 

8: sp_organelle : * 

9 : sp_phage : * 
10: sp_plant:* 
11: sp_rodent : * 
12: sp_virus : *. 
13 : sp_vertebrate : * 
14 : sp_unclassif ied: * 
15: sp_rvirus : * 
16: sp_bacteriap : * 
17: sp__archeap : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 

No. Score Match Length DB ID 



Description 



1 


4851 


81 


. 9 


986 


4 


Q8IUA4 


Q8iua4 homo sapien 


2 


428 9 


72 


. 4 


1162 


11 


Q8BGM9 


Q8bgm9 mus musculu 


3 


4277.5 


72 


. 2 


1163 


11 


Q8K3G8 


Q8k3g8 mus musculu 


4 


3658 . 5 


61 


. 8 


1046 


11 


Q8BGK7 


Q8bgk7 mus musculu 


5 


2527 


42 


. 7 


639 


11 


Q8K290 


Q8k2 90 mus musculu 


6 


1531 


25 


. 8 


392 


4 


Q96B16 


Q96bl6 homo sapien 


7 


1201 . 5 


20 


. 3 


375 


11 


Q8BHF5 


Q8bhf5 mus musculu 


8 


1163 


19 


. 6 


356 


11 


Q8BH78 


Q8bh78 mus musculu 


9 


1152 . 5 


19 


. 5 


357 


11 


Q8K3G7 


Q8k3g7 mus musculu 


10 


867 


14 


. 6 


179 


6 


Q9GM33 


Q9gm33 macaca fasc 


11 


788 . 5 


13 


.3 


760 


13 


Q90638 


Q90638 gallus gall 


12 


779 


13 


. 2 


780 


11 


Q8K4S4 


Q8k4s4 mus musculu 


13 


778 


13 


. 1 


780 


11 


Q8K0T0 


Q8k0t0 mus musculu 


14 


699 


11 


. 8 


643 


11 


Q8CCU2 


Q8ccu2 mus musculu 


15 


688 


11 


. 6 


199 


4 


Q9BQ59 


Q9bq5 9 homo sapien 


16 


672 


1 1 


. 3 


208 


13 


Q90637 


Q90637 gallus gall 


1 / 


671 


1 1 


. 3 


267 


11 


Q63765 


Q63765 rattus sp. 


18 


626 


10 


. 6 


237 


11 


Q8C6D5 


Q8c6d5 mus musculu 


19 


623 . 5 


10 


. 5 


236 


11 


Q8VBU0 


Q8vbu0 rattus norv 


20 


566 


9 


. 6 


595 


5 


Q9VMV9 


Q9vmv9 drosophila 


2 1 


528 


8 


. 9 


234 


5 


Q9VMW3 


Q9vmw3 drosophila 


22 


528 


8 


. 9 


2484 


5 


Q9U347 


Q9u34 7 caenorhabdi 


23 


526 


8 


. 9 


224 


5 


Q9VMW1 


Q9vmwl drosophila 


24 


520 


8 


. 8 


222 


5 


Q9VMW4 


Q9vmw4 drosophila 


25 


519 . 5 


8 


. 8 


2607 


5 


Q23187 


Q2318 7 caenorhabdi 


26 


518 


8 , 


. 7 


202 


5 


Q9VMW2 


Q9vmw2 drosophila 


27 


357.5 


6 . 


. 0 


2768 


5 


Q9VC00 


Q9vc00 drosophila 


28 


341.5 


5 . 


. 8 


5412 


5 


Q9W596 


Q9w596 drosophila 


29 


340.5 


5 . 


. 7 


222 


5 


Q23188 


Q2318 8 caenorhabdi 


3 0 


339 


5 , 


. 7 


1150 


5 


Q8IMM6 


Q8imm6 drosophila 


3 1 


334 


5 , 


. 6 


16215 


5 


Q9NFS3 


Q9nfs3 drosophila 


32 


334 


5 . 


. 6 


18074 


5 


Q9I7U4 


Q9i7u4 drosophila 


33 


331 


5 . 


. 6 


7962 


4 


Q10465 


Q10465 homo sapien 


34 


331 


5 . 


. 6 


34350 


4 


Q8WZ42 


Q8wz42 homo sapien 


35 


325 . 5 


5 . 


. 5 


5327 


5 


076891 


076891 drosophila 


36 


322 . 5 


5 . 


4 


846 


5 


Q8MRP6 


Q8mrp6 drosophila 


37 


322 . 5 


5 . 


4 


1109 


5 


Q9VAY4 


Q9vay4 drosophila 




JlO 


c 

D . 






cr 

b 


vbLo 


Q9vgc8 drosophila 


39 


317.5 


5 . 


4 


4969 


11 


Q8CF91 


Q8cf91 mus musculu 


40 


317.5 


5 . 


4 


5165 


11 


Q8CF92 


Q8cf92 mus musculu 


41 


316 


5 . 


3 


842 


5 


Q9VGC9 


Q9vgc9 drosophila 


42 


315 


5 . 


3 


10578 


5 


Q8ISF5 


Q8isf 5 caenorhabdi 


43 


315 


5 . 


3 


18519 


5 


Q8ISF6 


Q8isf 6 caenorhabdi 


44 


315 


5 . 


3 


18534 


5 


Q8ISF7 


Q8isf7 caenorhabdi 


45 


314 


5 . 


3 


864 


5 


Q95U45 


Q95u45 drosophila 



ALIGNMENTS 



RESULT 1 
Q8IUA4 

ID Q8IUA4 PRELIMINARY; PRT; 986 AA. 

AC Q8IUA4; 

DT 01-MAR-2003 {TrEMBLrel . 23, Created) 

DT 01-MAR-2 003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 



DE RNT4 (RTN4) . 

GN RTN4 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI__TaxID= 9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Oertle T. , van der Putten H. , Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4."; 

RL Submitted (OCT-2002) to the EMBL / GenBank / DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Oertle T. , Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Van der Putten H. ; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/ DDB J databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RC TISSUE=TestiS; 

RX MEDLINE=22376540; PubMed=12488097 ; 

RA Oertle T. , van der Putten H. , Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4."; 

RL J. Mol. Biol. 325:299-323(2003). 

DR EMBL; AY102285; AAM64244.1; -. 

DR EMBL; AY123245; AAM64249.1; 

DR EMBL; AY123246; AAM64250.1; 

DR EMBL; AY123247; AAM64251.1; -. 

DR EMBL; AY123248; AAM64252.1; 

DR EMBL; AY123249; AAM64253.1; 

DR EMBL; AY123250; AAM64254.1; -. 

SQ SEQUENCE 986 AA; 108449 MW; 0CDE8F647 03 64 15A CRC64; 

Query Match 81.9%; Score 4851; DB 4; Length 986; 

Best Local Similarity 98.9%; Pred. No. 7.8e-234; 

Matches 976; Conservative 1; Mismatches 8; Indels 2; Gaps 2; 
Qy 193 MDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQ 252 

MIIIIMIIIIIIIIIIIIIIIIIIII IIIIIIIIIIIIIIIIMIIIIIIIIIIIII 

Db 1 MDLKEQPGNTISAGQEDFPSVLLETAASLPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQ 60 

Qy 253 ENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVK 312 

II 1 1 1 M M 1 1 1 IM M Ml M I M 1 1 II II I M I M 1 1 1 Ml 1 1 1 MINI I 

Db 61 ENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVK 12 0 

Qy 313 NKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYAD 372 

1 1 1 1 1 1 1 m i m M i 1 1 f r 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 f f i r i 1 1 j j i r r i ^ j 1 1 1 1 r r 1 1 1 1 

Db 121 NKDEEEKLVSNNILHNQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYAD 18 0 

Qy 373 FKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTS 432 

MMMI MMIMMMMIMM IMMIII IMMMI MIMIII MMI 

Db 181 FKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTS 24 0 



Qy 433 FPSTPEGI KDRSGAYITCAPFNPAATESIATNI FPLLEDPTSENXTDEKKI EEKKAQI VT 492 

MIIIIIIINIMhMlllllllllll.il 1 1 1 1 1 1 I MMMMMM 

Db 241 FPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLGDPTSENKTDEKKIEEKKAQIVT 300 

Qy 4 93 EKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEV 552 

MINIMUM MM MMMMMIMI MMMMMM IMMMMM 

Db 301 EKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEV 360 

Qy 553 TGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAVPS 612 

1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 r 1 1 1 1 1 1 J 

Db 361 TGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAVPS 420 

Qy 613 AGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSL-KVSGIKEEIKEPENINAA 671 

III MIMMIMM IMIIIMMIMI MMI Mill MIMIIIII 

Db 421 AGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKKVSGIKEEIKEPENINAA 480 

Qy 672 LQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVD 731 

Mill 1 1 1 1 1 : 1 1 1 1 Mill 1 1 1 1 1 1 i 1 1 1 1 [ i 1 1 1 1 J 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 

Db 481 LQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVD 540 

Qy 732 LFSDDS I PDVPQKQDETVMLVKESLTETSFESMI EYENKEKLSALPPEGGKPYLESFKLS 791 

Mill : ! 1 1 1 1 . 1 : 1 i i , 1 1 1 1 1 1 : 1 i Ml III MMMMMM: IMMMMM 

Db 541 LFSDDS I PDVPQKQDETVMLVKESLTETSFESMI EYENKEKLSALPPEGGKPYLESFKLS 600 

Qy 792 LDNTKDTLLPDEVSTLS KKEKI PLQMEELSTAVYSNDDLF I S KEAQ I RETETFSDSS P I E 851 

1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 II 1 1 1 1 1 1 II 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 

Db 601 LDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIE 660 

Qy 852 IIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQP 911 

MMMMMM llllll II MM Mill III! II II I M II 1 1 II I M II II II II I 

Db 661 IIDEFPTLISSKTDSFSKIAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQP 72 0 

Qy 912 KVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKKLPSDT 971 

M M M I II II II I I I II I M I I II I II I II I M 

Db 721 KVEEKI SFSDDFSKNGSATSKVLLLPPDVSALA-TQAEI ESI VKPKVLVKEAEKKLPSDT 779 

Qy 972 EKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSI VSVTAYIAL 1031 

II II 1 1 M II II h I MMMMMIMMMMIMMMMMMIMMMIMM 

Db 780 EKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIAL 839 

Qy 1032 ALLS VT I S FR I YKGVI QA I QKS DEGH P FRAYLES E VA I S E ELVQKYSNSALGHVNCT I KE 1091 

II I M 1 1 II I M I II 1 1 1 1 1 II 1 1 II I M 1 1 1 M 1 1 1 II I II I M I II 1 1 1 1 1 II I II II 

Db 84 0 ALLSVTI SFRI YKGVIQAI QKS DEGH P FRAYLES EVA I SE ELVQKYSNSALGHVNCT I KE 899 

Qy 1092 LRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS VPVI YERHQAQI DHY 1151 

MIIIIIMIIIIIMMIIIIMII IMIMIMMMMMMMIIMMMIMM 

Db 900 LRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LAL I SLFS VP VI YERHQAQI DHY 959 

Qy 1152 LGLANKNVKDAMAKIQAKI PGLKRKAE 1178 

II II I I I I II II II II II II I I II I I I 
Db 960 LGLANKNVKDAMAKIQAKI PGLKRKAE 986 



RESULT 2 
Q8BGM9 

ID Q8BGM9 PRELIMINARY; PRT; 1162 AA. 

AC Q8BGM9; 



DT 


01 -MAR- 2 003 (TrEMBLrel. 23, 


Created) 


DT 


01-MAR-2003 (TrEMBLrel. 23, 


Last sequence update) 


DT 


01-MAR-2003 (TrEMBLrel. 23, 


Last annotation update) 


DE 


RTN4 . 




GN 


RTN4 . 




OS 


Mus musculus (Mouse) . 




OC 


Eukaryota; Metazoa,- Chorda t a; Crania ta; Vertebra ta; Euteleostomi ; 


OC 


Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus . 


OX 


NCBI TaxID=10090; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=12 9/SvcJ7, and 12 9SvcJ7; 


RA 


Oertle T. , van der Putten H 


. , Schwab M . E . ; 


RT 


"Genomic Structure and Functional Characterization of the Promoter 


RT 


Structures of Human and Mouse Nogo/Rtn-4 . " ; 


RL 


Submitted (OCT-2002) to the 


EMBL/GenBank/DDBJ databases. 


RN 


[2] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN-129/SvcJ7, and 12 9SvcJ7; 


RA 


Oertle T. , Schwab M.E.; 




RL 


Submitted (MAY-2002) to the 


EMBL/GenBank/DDBJ databases. 


RN 


[3] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=129/SvcJ7; 




RA 


Van der Putten H.; 




RL 


Submitted (MAY-2002) to the 


EMBL/GenBank/DDBJ databases. 


KIN 


[4] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=129SvcJ7; 




RA 


Van der Putten H., Mir A. ; 




RL 


Submitted (MAY-2002) to the 


EMBL/GenBank/DDBJ databases. 


DR 


EMBL ; AY102284; AAM73506.1; 




DR 


EMBL; AY102286; AAM73511.1; 




SQ 


SEQUENCE 1162 AA; 126613 


MW; 855697FBEE11781F CRC64 ; 



Query Match 72.4%; Score 4289; DB 11; Length 1162; 

Best Local Similarity 73.9%; Pred. No. l.le-205; 

Matches 883; Conservative 109; Mismatches 153; Indels 50; Gaps 18; 
QY 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKP 59 

Mhlll Mill MINI llllllll MlllhMMIIhMIIIIMMIIIM 

Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEEEEEDDEDLEELEVLERKP 60 

Qy 60 AAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPA 119 

IIIIIMMI III llhll U I I I 'I ! MIIIMI: || :| | 

Db 61 AAGLSAAPVP- - PAA-APLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAAS A 114 

Qy 12 0 PSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRR 17 9 

M MM MIMMMMM II II II M MMIMMIMM 

Db 115 PSLPPAAAVLPSKLPEDDEPPAR- - PPAPAGAS PLAE PAAPPSTPAAPKRR 163 

Qy 18 0 GSSG AWXXXXKI MDLKEQPGNTI SAGQEDFPSVLLETAASXPSLS 225 

M : ! 1 1 f 1 1 1 1 1 1 1 1 = I = I f 1 1 1 1 1 1 1 Mill MM 

Db 164 GSGSVDETLFALPAASEPVI PSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLS 223 

Qy 226 PLSAAS FKEHE YLGNLS T VL PTEGTLQENVS EAS KE VS E KAKTLL I DRDLTE FS ELE YS E 285 



Db 



224 PLSTVSFKEHGYLGNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSE 283 



Qy 


286 


MGSSFSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEV 

1 1 M h 1 1 II 1 h M 1 : 1 1 : 1 M M h h 1 1 = II III II h 1 1 1 1 1 

MGSSFNGSPKGESAMLVENTKEEVI VRSKDKED-LVCSAALHNPQESPATLTKWKEDGV 


345 


Db 


284 


342 


Qy 


346 


VSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNLES 

= 1 || M 1 1 ::: 1 1 1 : I 1 1 I 1 1 1 1 1 1 1 : 1 1 1 1 I : 1 1 1 : 1 1 1 : 1 : 1 1 
1 1 1 1 III 1 II II 1 II 1 1 1 II 1 Mill 1 1 1 1 1 1 III 

MS PEKTMD I FNEMKMS WAPVREE YADFKPFEQAWEVKDTYEGSRDVLAA RANMES 


404 


Db 


343 


398 


Qy 


405 


KVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATN 


464 


Db 


399 


1 1 1 1 1 1 II 1 1 1 1 | | | | | | | : : IN Ml : | | | | | || 1 M : I 1 1 1 II 
KVDKKCFEDSLEQKGHGKDSESRNENASFPRTPELVKDGSRAYITCDSFS-SATESTAAN 


457 


Qy 


465 


I FPLLEDPTSENXTDEKKI EEKKAQI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKV 


524 


Db 


458 


M H II 1 1 1 1 M 1 II 1 1 h 1 1 1 M II 1 II 1 II 1 1 1 II III 1 1 1 1 ; 1 1 M 1 

I FPVLEDHTSENKTDEKKI EERKAQI ITEK-TSPKTSNPFLVAIHDSEADYVTTDNLSKV 


516 


Qy 


525 


TEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 

II III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 : 1 1 1 1 1 1 | MINI Mill 
TEAWATMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCP 


584 


Db 


517 


576 


Qy 


585 


SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENPP 


643 


Db 


577 


| | | | : | | | | | | | | | | | | | | | | | | | : | | | | | | | | | : | | | | | | : | : | | | | | | | | 
SFEEAEATPSPVLPDIVMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPP 


636 


Qy 


644 


P YEEAMS VSLKVSG I KEE I KE PEN I NAALQETEAP YI S I ACDL I KETKLSAEPAPDFSDY 


703 


Db 


637 


1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 M III II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | : | : | | : | 
PYEEAMSVALKTSDSKEEIKEPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPEFSNY 


696 


Qy 


704 


SEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFES 


763 


Db 


697 


| | : | | | : | | | | M | : | || | : | | | | M 1 M 1 1 II : 1 1 1 hi 1 1 1 - 1 1 1 II 1 1 1 • 
SEIAKFEKSVPDHCELVDDSSPESEPVDLFSDDSI PEVPQTQEEAVMLMKESLTEVS-ET 


755 


Qy 


764 


MI EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTA 


823 


Db 


756 


: : : : : | | : | | | | | | | | | | | | | : : | | | | : | ; | | : | | | | | | | | | : | | 
VTQHKHKERLSASPQEVGKPYLESFQPNLHITKDA-ASNEIPTLTKKETISLQMEEFNTA 


814 


Qy 


824 


VYSNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKS 


883 


Db 


815 


: | | | | | | Ml ::: | : | | | | | | | M 1 1 1 1 1 1 1 1 'hi II M 1 1 1 1 11 M II 
IYSNDDLLSSKEDKMKESETFSDSSPIEI IDEFPTFVSAKDDS PKEYTDLEVSNKS 


870 


Qy 


884 


EIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSAL 


943 


Db 


871 


MM || IUhlll HI | Ml : 1 h 1 1 h h 1 1 1 1 1 M II II 

EIANVQSGANSLPCSELPCDLSFKNTYPKDEAHV- -SDEFSKSRSSVSKVPLLLPNVSAL 


928 


Qy 


944 


GHTQAEIESI VKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSVVDLLYWRDIKKT 


1003 


Db 


929 


: 1 M -Mill IMhIIMMIIIIII = h MM 1 1 1 1 1 1 . 1 1 1 1 II 1 

-ESQIEMGNIVKPKVLTKEAEEKLPSDTEKEDRSLTAVLSAELNKTSVATOLLYWRDIKKT 


987 


Qy 


1004 


GWFGASLFLLLSLTVFS I VS VTAYI ALALLSVTI SFRI YKGVI QAI QKSDEGHPFRAYL 


1063 


Db 


988 


1 1 M 1 1 1 1 II II 1 1 II 1 1 1 1 1 1 M 1 1 II II II 1 II M II 1 II II 1 II 1 II 1 II 1 M II II 

GWFGASLFLLLSLTVFS I VSVTAYI ALALLSVTI SFRI YKGVI QAI QKSDEGHPFRAYL 


1047 


Qy 


1064 


ESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNG 


1123 


Db 


1048 


1 1 1 M 1 1 II 1 1 II 1 II II II 1 II hllllllMII MIIIMIII IIIIIIIMI 1 
ESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNG 


1107 



Qy 1124 LTLLI LALI SLFS VPVI YERHQAQI DHYLGLANKN VKDAMAKI QAKI PGLKRKAE 1178 

IMIIIIIMIIhlllMIIIIMIIMIIIIhlMMIIIIIIIIIIIMII 

Db 1108 LTLLI LALI SLFS I PVI YERHQAQI DHYLGLANKS VKDAMAKI QAKI PGLKRKAE 1162 



RESULT 3 
Q8K3G8 



ID Q8K3G8 PRELIMINARY; PRT; 1163 AA. 

AC Q8K3G8; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2 002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2 003 {TrEMBLrel. 23, Last annotation update) 

DE Nogo-A. 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RA Jin W . , Long M . , Li R . , Ju G . ; 

RT "Cloning and expression of the mouse Nogo-A protein."; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AY114152; AAM77068.1; 

DR MGD; MGI : 1915835 ; Rtn4 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS5084 5; RETICULON; 1. 

SQ SEQUENCE 1163 AA; 126691 MW; 6B5F362 7994 17EA4 CRC64 ; 



Query Match 72.2%; Score 4277.5; DB 11; Length 1163; 

Best Local Similarity 73.8%; Pred. No. 3.9e-205; 

Matches 883; Conservative 108; Mismatches 154; Indels 51; Gaps 19; 



Qy 


l 


MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKP 

IIMII Mill MINI MINIM ■ 1 H H 1 1 1 . 1 1 1 1 1 

MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEEEEEDDEDLEELEVLERKP 


59 


Db 


i 


60 


Qy 


60 


AAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPA 
MINI III III llhll :| IIIIIIIIIMIII 11111111= II :| | 
AAGLSAVPVP- -PAA-APLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAAS- - -A 


119 


Db 


61 


114 


Qy 


120 


PSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRR 


179 


Db 


115 


II MM MIIMMIMM II II M II MIIMIMIMM 

PSLPPAAAVLPSKLPEDDEPPAR- -PPAPAGASPLAE PAAPPSTPAAPKRR 


163 


Qy 


180 


GSSG A WXXXXKI MDLKEQPGNT I SAGQEDFPS VLLETAASXPSLS 


225 


Db 


164 


1 = IllllllllllhhIIIIIIIII Mill MM 

GSGSVDETLFALPAASEPVI PSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLS 


223 


Qy 


226 


PLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSE 


285 


Db 


224 


III Mill MUM 1 MlhM ::||hh hi III Mill 

PLSTVSFKEHGYLGNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSE 


283 


Qy 


286 


MGSSFSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEV 


345 



Mllh III MhM I :|h||::||:|: || : || || | | |, | 

Db 284 MGSSFNGSPKGESAMLVENTKEEVI VRSKDKED-LVCSAALHNPQESPATLTKWKEDGV 342 

Qy 34 6 VSSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDS-DMLAAGGKIESNLES 4 04 

: I 1 1 I Ml ■- ■ - \ I h 1 1 1 1 1 1 1 1 1 1 h Mllh I I hill MM I 

Db 343 MSPEKTMDIFNEMKMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANMES 3 98 

Qy 405 KVDKKCFADSLEQTNHEKDS ES SNDDTSF PSTPEG I KDRSGAY I TCAPFNPAATES I ATN 464 

1 1 1 1 1 1 1 Mill M Mill h = II : 1 1 1 1 Ml I Mill I Mill I I 

Db 399 KVDKKCFEDSLEQKSHGKDSESRNENASFPSTPELVKDGSRAYITCDSFT-SATESTAAN 457 

Qy 4 65 I FPLLEDPTSENXTDEKKI EEKKAQI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKV 524 

hill 1 1 1 1 III! || 1 1 1 1 1 1 || Ml MIIMIhll 

Db 458 IFPVLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAIHDSEADYVTTDNLSKV 516 

Qy 525 TEEWANMPEGLTPDLVQEACESELNEOTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 584 

M Ml MMMMMMMMMM IIMIIIIIhlllllll : I I I : I I Mill 
Db 517 TEAWATMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCP 576 

Qy 585 SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENPP 643 

MlhlMMMMIIMMMM Ml MM llhllll I hh II .1111 

Db 577 SFEEAEATPSPVLPDIVMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPP 636 

Qy 644 P YEEAMSVSLKVSGI KEE I KEPENI NAALQETEAP YI S I ACDLI KETKLSAEPAPDFSDY 703 

MIIMIhll I lllllllh III II IIMIIIIIIIIIIIIII Ihl Ihl 

Db 63 7 PYEEAMSVALKTSDAKEEI KEPESFNAAAQEAEAPYI SI ACDLI KETKLSTEPSPGFSNY 696 

Qy 704 SEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFES 763 

MM! h MM llhlllhllllllllllllhIM hi llhllllll I h 

Db 697 SEIAKFEKSVPDHCELVDDSSPESEPVDLFSDDSI PEVPQTQEEAVMLMKESLTEVS-ET 755 

Qy 764 MIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTA 823 

■ - - I h I II I I I I I I I M I : M III :|= Ihlll I Mill Mi 

Db 756 VTQHKHKERLSASPQEVGKPYLESFQPNLHITKDA-ASNEIPTLTKKETISLQMEEFNTA 814 

Qy 824 VYSNDDLFI SKEAQI RETETFSDSSPI EI I DEFPTLI SSKTDSFSKLAREYTDLEVSHKS 883 

■Mill; Ml -MMMMMIMMIMM MM II MIMIIIhll 

Db 815 I YSNDDLLSSKEDKMKESETFSDSSP I EI I DEFPTFVSAKDDS PKEYTDLEVSNKS 870 

Qy 884 EIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSAL 943 

MM M lllhlll III II Ml : Ihllh h Ml II hllll 

Db 871 EIANVQSGANSLPCSELPCDLSFKNTYPKDEAHV- -SDEFSKSRSSVSKVPLLLPNVSAL 928 

QY 944 GHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKT 1003 

= 1 I' =1111111 Ml hi I I llllll I II :|: Ihl I I || I MINIMI I I 
Db 929 -ESQIEMGNIVKPKVLTKEAEEKLPSDTEKEDRSLTAVLSAELNKTSWDLLYWRDIKKT 987 

Qy 1004 GW-FGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAY 1062 

Ml II 1 1 1 1 M II M II I M II M I II II I II I II 1 1 1 M II II 1 1 1 II M II II I II I 

Db 988 GWYFGASLFLLLSLTVFS I VS VTAYI ALALLS VTI SFRI YKGVI QAI QKSDEGHPFRAY 1047 

Qy 1063 LESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1122 

1 1 M 1 1 1 1 II II I M II I M I II I M I M II I II 1 1 1 1 II II M II II 1 1 1 II II I M I 

Db 1048 LESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1107 

Qy 1123 GLTLLI LALI SLFS VPVI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

lllllllllllllhlllllllllllllllllllhllllllllllllllllllll 



Db 1108 GLTLL I LALI SLFS I PVI YERHQAQI DHYLGLANKSVKDAMAKI QAKI PGLKRKAE 1163 



RESULT 4 
Q8BGK7 

ID Q8BGK7 PRELIMINARY; PRT; 1046 AA . 

AC Q8BGK7; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2 003 (TrEMBLrel . 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel- 23, Last annotation update) 

DE RTN4 . 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T. , van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2 002) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=12 9/SvcJ7, and 129SvcJ7; 

RA Oertle T. , Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN-129/SvcJ7; 

RA Van der Putten H. ; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129SvcJ7; 

RA Van der Putten H. , Mir A. ; 

RL Submitted (MAY-2 002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY102280; AAM73502 . 1 ; -. 

DR EMBL; AY102286; AAM73507.1; -. 

SQ SEQUENCE 1046 AA; 114221 MW; 8CE2E2238ED51222 CRC64 ; 

Query Match 61.8%; Score 3658.5; DB 11; Length 1046; 

Best Local Similarity 69.5%; Pred. No. 2.6e-174; 

Matches 767; Conservative 108; Mismatches 148; Indels 81; Gaps 17; 

QY 97 APPVAPERQ PSWDPS PVSSTVPAP- - -SPLSAAAVSPSKLPED 136 

MM I H M I II I I I I I I 

Db 2 APPLAGGGQKGGAASEAWVPSLFVGVSGSTCTAAKSLVPIPARSSRLSAA 51 

Qy 137 DEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLK 196 

= I := II 1=1 I- = Illlll 

Db 52 RNETLFALPA ASEPVI PSSAE KIMDLK 78 

Qy 197 EQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQENVS 256 

MINI HIIMI I Mill Illlll Mill Illlll I ||||::| :: 
Db 7 9 EQPGNTVSSGQEDFPSVLFETAASLPSLSPLSTVSFKEHGYLGNLSAVASTEGTIEETLN 13 8 



Qy 


257 


EASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEI IVKNKDE 


316 


Db 


139 


Mhh hi 1 1 1 1 1 1 1 I 1 I || h I I I | | | : = | | : | | : | | : : | | : 
EASRELPERATNPFVNRESAEFSVLEYSEMGSSFNGSPKGESAMLVENTKEEVIVRSKDK 


198 


Qy 


317 


EEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADFKPF 

h II : II II 1 MhllM hi II 1 III :::| Ihllllllllll 

ED - LVGSAALHNPQES PATLTKWKEDGVMS PEKTMD I FNEMKMS WAPVREEYADFKPF 


376 


Db 


199 


257 


Qy 


377 


ERVWEVKDSKEDS -DMLAAGGKI ESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPS 

h llllh 1 1 hill ; Mill 1 Mill h: III 

EQAWEVKDTYEGSRDVLAA RANMESKVDKKCFEDSLEQKGHGKDSESRNENASFPR 


435 


Db 


258 


313 


Qy 


436 


TPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQIVTEKN 4 95 
III .11 1 1 1 1 1 1 1 - .MM i 1 II M 1 1 l i i i i i i i i i i i mi 


Db 


314 


III ' ' "Mil 1 MM' 1 i 1 M M 1 M : 1 1 M : 1 II 

TPELVKDGSRAYI TCDSFS - SATESTAANI FPVLEDHTSENKTDEKKI EERKAQI I TEK- 


371 


Qy 


496 


TSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEVTGT 


555 


Db 


372 


II INI II Ml I II II II h 1 1 II III IMMMMMMMIMM Ml 

TSPKTSNPFLVAIHDSEADYVTTDNLSKVTEAWATMPEGLTPDLVQEACESELNEATGT 


431 


Qy 


556 


KI AYETKMDLVQTS E VMQESLYPAAQLCPSFEES EATPS P VLPD I VMEAPLNSAVPSAGA 


615 


Db 


432 


M 1 1 II h II 1 1 1 1 1 Mlhll MMMMhlMMMMMMMMM Ml II 

KIAYETKVDLVQTSEAI QES I YPTAQLCPSFEEAEATPS PVLPDI VMEAPLNSLLPSTGA 


491 


Qy 


616 


SVIQPSSSPLEA-SSVNYES I KHEPENPPPYEEAMSVSLKVSGI KEEI KEPENINAALQE 674 


Db 


492 


M 111 = 1111 1 hh II lllllllllllllhll 1 lllllllh III II 
SVAQPSASPLEVPSPVSYDGI KLEPENPPPYEEAMSVALKTSDSKEEI KEPESFMAAAQE 551 


Qy 


675 


TEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFS 


734 


Db 


552 


MMIMMMMIMM Ihhihilhll 1 MM llhlllhllllllll 

AEAPYISIACDLIKETKLSTEPSPEFSNYSEIAKFEKSVPDHCELVDDSSPESEPVDLFS 


611 


Qy 


735 


DDSI PDVPQKQDETVMLVKESLTETSFESMI EYENKEKLSALPPEGGKPYLESFKLSLDN 


794 


Db 


612 


Mllhlll hi llhllllll 1 I- ::::|hlll I I Mlllllh M 
DDSIPEVPQTQEEAVMLMKESLTEVS-ETVTQHKHKERLSASPQEVGKPYLESFQPNLHI 


670 


Qy 


795 


TKDTLLPDEVSTLS KKEKI PLQMEELSTAVYSNDDLF I S KEAQ I RETETFSDS SPIEI ID 

Ml :h IMII 1 Mill MhMIMI III -MMMIMMIMM 

TKDA-ASNEI PTLTKKETI SLQMEEFNTAI YSNDDLLSSKEDKMKESETFSDSSPI EI ID 


854 


Db 


671 


729 


Qy 


855 


EFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVE 

MM MM II MllllllhlllMI II 1 1 1 1 : 1 II III II II 1 


914 


Db 


730 


EF PTF VSAKDDS PKEYTDLEVSNKSEI ANVQSGANSLPCSELPCDLSFKNTYPKDE 


785 


Qy 


915 


E KI S FS DD F S KNG SATS KVLLL P PDVSALGHTQAE I ES I VKP KVL E KEAE KKL P S DT E KE 


974 


Db 


786 


: M h Ml || MINI :| h :||||||| 1 1 ' 1 1 

AHV- -SDEFSKSRSSVSKVPLLLPNVSAL-ESQIEMGNIVKPKVLTKEAEEKLPSDTEKE 


842 


Qy 


975 


DRSPSAI FSADLGKTSVVDLLYWRD I KKTGVVFGASLFLLLSLTVFSI VSVTAY I ALALL 

Ml : 1 : MM II 1 II 1 1 II II M M II 1 1 1 II 1 1 1 1 1 II 1 1 1 1 II II 1 II II II 1 1 1 

DRSLTAVLSAELNKTSWDLLYWRDI KKTGVVFGASLFLLLSLTVFSI VSVTAYIALiALL 


1034 


Db 


843 


902 


Qy 


1035 


S VTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNCTI KELRR 


1094 


Db 


903 


M II II 1 M II II 1 II 1 II 1 II 1 1 1 1 M II M II 1 1 1 1 1 1 II II 1 1 1 1 1 M 1 II, "1 

SVTI SFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTI KELRR 


962 



Qy 1095 LFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALI SLFSVPVI YERHQAQIDHYLGL 1154 

IIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIhlllllllllllllllll 

Db 963 LFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVI YERHQAQIDHYLGL 1022 

Qy 1155 ANKNVKDAMAKI QAKI PGLKRKAE 1178 

llhllllllllllllllllllll 
Db 1023 ANKSVKDAMAKI QAKI PGLKRKAE 104 6 

RESULT 5 
Q8K2 9 0 

ID Q8K290 PRELIMINARY; PRT; 639 AA. 

AC Q8K290; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2 002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel . 23, Last annotation update) 

DE Hypothetical protein. 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia ; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090 ; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Strausberg R. ; 

RL Submitted (JUN-2002) to the EMBL/ GenBank / DDB J databases. 

DR EMBL; BC032192; AAH32192.1; -. 

DR MGD; MGI : 1915835; Rtn4 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 639 AA; 70312 MW; 309A19DA37603F11 CRC64 ; 

Query Match 42.7%; Score 2527; DB 11; Length 63 9; 

Best Local Similarity 79.5%; Pred. No. 3.6e-118; 

Matches 515; Conservative 57; Mismatches 66; Indels 10; Gaps 6; 
Qy 532 MPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEA 591 

llllllllllllllllllll II M I MMM II 1 1 1 MIMM MIMIMM 

Db 1 MPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESI YPTAQLCPSFEEAEA 60 

QY 592 TPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENPPPYEEAMS 650 

IMIMIIIIIIIMII HI I I I I llhllll I hh || I I I I I I I I I I I | I 

Db 61 TPSPVLPDI VMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMS 120 

Qy 651 VSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVE 710 

Mil I IMIMIM III II IIIIIIIIIIMIIIMI Ihhlhllhll I 

Db 121 VALKTSDAKEEIKEPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPEFSNYSEIAKFE 180 

Qy 711 QPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENK 770 

: MM II M I M I • I M II II II I M h 1 1 1 hi MIMIMM I I- --I 

Db 181 KSVPDHCELVDDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-ETVTQHKHK 239 

Qy 771 EKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVYSNDDL 830 

hill I I IMIMIh M Ml = h Ihlll I Mill MhlllMI 

Db 240 ERLSASPQEVGKPYLESFQPNLHITKDA-ASNEIPTLTKKETISLQMEEFNTAIYSNDDL 298 



Qy 


831 


FISKEAQIRETETFSDSSPIEI IDEFPTLI SSKTDSFSKLAREYTDLEVSHKSEIANAPD 


890 


Db 


299 


Ml — hMMMMMMIMM :hl II M 

LSSKEDKMKESETFSDSSPIEI IDEFPTFVSAKDDS PKEYTDLEVSNKSEIANVQS 


354 


Qy 


891 


GAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEI 
II MM! IN || || I : IMIh 1 Ml || hill M 1 


950 


Db 


355 


GANSLPCSELPCDLSFKNTYPKDEAHV- -SDEFSKSRSSVSKVPLLLPNVSAL-ESQIEM 


411 


Qy 


951 


ESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGAS 


1010 


Db 


412 


MINIM IIMMIIIIMI II :h Ihl llllllllllll III IMIIIII 

GNIVKPKVLTKEAEEKLPSDTEKEDRSLTAVLSAELNKTSVVDLLYWRDIKKTGVVFGAS 


471 


Qy 


1011 


LFLLLSLTVFSIVSVTAYIALA.LLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAIS 


1070 


Db 


472 


Ml Mill llllll III llllllllllllll lllllllllllllllllllllll MINI 

LFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAIS 


531 


Qy 


1071 


EELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LA 


1130 


Db 


532 


IIMIII 1 1 : 1 1 1 Ml MM 1 1 U M 1 M, 

EELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LA 


591 


Qy 


1131 


L I SLFS VP VI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 




Db 


592 


MMMI Mill IMM Mill IMMMIMIIMM MM 

LI SLFS I P VI YERHQAQ I DHYLGLANKS VKDAMAKI QAKI PGLKRKAE 63 9 





RESULT 6 
Q96B16 

ID Q96B16 PRELIMINARY; PRT; 392 AA. 

AC Q96B16; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 {TrEMBLrel. 23, Last annotation update) 

DE Hypothetical protein (RTN4) . 

GN RTN4 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Kidney ; 

RA Strausberg R.; 

RL Submitted (OCT-2001) to the EMBL / GenBank / DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Oertle T. , van der Putten H. , Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL / GenBank / DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Oertle T. , Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Van der Putten H. ; 



RL Submitted (MAY-2 002) to the EMBL/ GenBank/ DDB J databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22376540; PubMed=1248 8 097 ; 

RA Oertle T. , van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL J . Mol. Biol. 325:299-323(2003). 

DR EMBL; BC016165; AAH16165.1; 

DR EMBL; AY102285 ; . AAM64242 . 1 ; -. 

DR EMBL; AY102278; AAM64247.1; -. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 392 AA; 42274 MW; D7B2AA5E83 9E58AD CRC64 ; 



Query Match 25.8%; Score 1531; DB 4; Length 3 92; 

Best Local Similarity 32.4%; Pred. No. 9e-69; 

Matches 382; Conservative 7; Mismatches 3; Indels 786; Gaps 4; 
Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 6 0 

IIIIMMIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIII 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 6 0 

Qy 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 

II IMIMM IMIMI IMIMI IMIMIMM III II MUM lllllll I MM 

Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

i 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 

Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

QY 181 SSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGN 240 



Db 181 SSGSV 185 

QY 241 LSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAV 300 

Db 186 185 

Qy 301 IVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRV 360 

Db 186 185 

Qy 361 AVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNH 42 0 

Db 186 185 

Qy 421 EKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDE 480 

hi I 

Db 186 DETLF 190 

Qy 481 KKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEVVANMPEGLTPDL 540 

Db 191 190 



Qy 541 VQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDI 6 00 
Db 191 190 

Qy 601 VMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKVSGIKE 660 

1 = 1 = 1 

Db 191 ALPAA 195 

Qy 661 E I KEPEN I NAALQETEAP Y I S I ACDLI KETKLSAEPAPDFSD YS EMAKVEQP VPDHS ELV 72 0 

Db 196 195 

Qy 721 EDSSPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFESMI EYENKEKLSALPPEG 78 0 

II II 

Db 196 SEPV 199 

Qy 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 840 
Db 200 199 

Qy 841 TETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 900 
Db 200 199 

Qy 901 PHDLSLKNI QPKVEEKI S FSDDFS KNGSATSKVLLLP PDVSALGHTQAE I ES I VKPKVLE 960 
Db 200 199 

Qy 961 KEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVF 102 0 

= -I II I I I ! I I I I I I' I II I I I I I || I I' I 

Db 2 00 1 RSSAWDLLYWRDI KKTGWFGASLFLLLSLTVF 234 

Qy 1021 SI VSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 1080 

IIIMIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIMMIIIIIIIMIMI 

Db 235 SIVSVTAYIAIALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 294 

Qy 1081 ALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFSVPVI 114 0 

IMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 

Db 295 ALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALI SLFSVPVI 354 

Qy 1141 YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

i lillll IMIII MINI llllll IIIMM 

Db 355 YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 392 

RESULT 7 
Q8BHF5 

ID Q8BHF5 PRELIMINARY; PRT; 375 AA. 

AC Q8BHF5; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE RTN4 . 

GN RTN4 . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 



OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T. , van der Putten H. , Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2 0 02) to the EMBL / GenBank / DDB J databases. 

RN [2] 

RP SEQUENCE FROM N. A. 

RC STRAIN=12 9/ Svc J7 , and 129SvcJ7; 

RA Oertle T. , Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N . A . 

RC STRAIN=129/SvcJ7 ; 

RA Van der Putten H. ; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N . A . 

RC STRAIN=12 9SvcJ7; 

RA Van der Putten R\ , Mir A.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY102282; AAM73504.1; -. 

DR EMBL; AY102286; AAM73509.1; -. 

SQ SEQUENCE 375 AA; 40300 MW; 23D9EB19BE671AE6 CRC64 ; 

Query Match 20.3%; Score 1201.5; DB 11; Length 375; 

Best Local Similarity 28.6%; Pred. No. 2.3e-52; 

Matches 337; Conservative 17; Mismatches 20; Indels 805; Gaps 11; 

Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKP 59 

Mhlll Mill MUM llllllll lllll|:|hlll|::||||||||||MII 

Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEEEEEDDEDLEELEVLERKP 60 

QY 60 AAGLSAAPVPTAPAAGAPLMDFGNDFVP PAPRGPLPAAP PVAPERQPSWDPS PVS STVPA 119 

IIIIIMIII III llhll : I Illlllllllllll lllllllh || M I 

Db 61 AAGLSAAPVP - - PAA-APLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAAS A 114 

Qy 12 0 PSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRR 179 

M MM I I Ml II II || || Mini 

Db 115 PSLPPAAAVLPSKLPEDDEPPAR- -PPAPAGASPLAE PAAPPSTPAAPKRR 163 

Qy 180 GSSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLG 239 

II 

Db 164 GS 165 

Qy 240 NLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESA 299 

Db 166 165 

Qy 300 VI VANPREE 1 1 VKNKDEEEKLVSNN I LHXQQELPTALTKLVKEDE WSS EKAKDS FNEKR 359 

Db 166 1 65 



Qy 



3 60 VAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTN 419 



Db 166 165 

Qy 420 HEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTD 479 

Db 166 GSVDETLF 173 

Qy 480 EKKIEEKKAQIWEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPD 53 9 

Db 174 173 

Qy 540 LVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPD 599 

Db 174 173 

Qy 600 IVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKVSGIK 659 

Db 174 ALPAA 178 

Qy 660 EE I KE PEN I NAALQETEAPY I S I ACDLI KETKLSAE PAPDFSDYS EMAKVEQP VPDHSEL 719 

Db 179 • -" 178 

Qy 720 VEDSSPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPE 779 

I I I I 

Db 179 SEPV 182 

Qy 78 0 GGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVYSNDDLFI SKEAQIR 83 9 

Db 183 182 

Qy 840 ETETFSDSSPIEI IDEFPTLISSKTDSFSKliAREYTDLEVSHKSEIANAPDGAGSLPCTE 899 

Db 183 182 

Qy 900 LPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVL 959 

Db 183 182 

Qy 960 EKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTV 1019 

Ul llllll lllllll MIINI MM 

Db 183 IPS SAWDLLYWRD I KKTGWFGASLFLLLSLTV 216 

Qy 1020 FSI VSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSN 1079 

MMMM MMM iMMI MMMM MMM: MMM MMMMMM 

Db 217 FSI VSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSN 276 

Qy 108 0 SALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALI SLFSVPV 113 9 

I I I I I I I I M I I I I I I M I I I I I I I I I I I I I M I I I I I I | | | | | | | | | | | | | | | | |: M 
Db 277 SALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPV 336 

Qy 114 0 I YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

IIIIIIMIIIIIIIIIhllllllllllllllllllll 

Db 337 I YERHQAQ I DHYLGLANKSVKDAMAKI QAKI PGLKRKAE 375 



RESULT 8 
Q8BH78 



ID Q8BH78 PRELIMINARY; PRT; 356 AA. 

AC Q8BH78; 

DT Ol-MAR-2003 (TrEMBLrel . 23, Created) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE RTN4 . 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T. , van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/ GenBank/DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T. , Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7; 

RA Van der Putten H. ; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN-12 9SVCJ7; 

RA Van der Putten H. , Mir A. ; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY102281; AAM73503.1; -. 

DR EMBL; AY102286; AAM73508.1; -. 

SQ SEQUENCE 356 AA; 38403 MW; 4366C03BA963 0B56 CRC64; 



Query Match 19.6%; Score 1163; DB 11; Length 356; 

Best Local Similarity 27.7%; Pred. No. 1.8e-50; 

Matches 326; Conservative 12; Mismatches 17; Indels 824; Gaps 8; 
Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKP 59 

II hill Mill III III llllllll lUMMIII-lll III III M 

Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEEEEEDDEDLEELEVLERKP 60 

Qy 6 0 AAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPA 119 

lllllllll! Ill llhll I Illlllllllllll Mlllllh II H I 

Db 61 AAGLSAAPVP- -PAA-APLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAAS A 114 

Qy 120 PSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRR 179 

II 1 1 1 1 MM Ill M I II II II II Illlllllllllll 

Db 115 PSLPPAAAVLPSKLPEDDEPPAR- -PPAPAGASPLAE PAAPPSTPAAPKRR 163 

Qy 180 GSSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLG 239 

II 

Db 164 GS 165 



24 0 NLS TVL PTEGTLQENVS EAS KE VS EKAKTLL I DRDLTEFSELEYSEMGSSFSVSPKAESA 299 
166 165 

300 VIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKR 359 
166 1 65 

3 60 VAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTN 419 

166 165 

42 0 HEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTD 479 

166 165 

480 EKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPD 539 

166 165 

54 0 LVQEACESELNEWGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPD 59 9 

166 165 

600 IVMEAPLNSAVPSAGASVIQPSSSPLEASSWYESIKHEPENPPPYEEAMSVSLKVSGIK 659 

166 165 

660 EEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSEL 719 

166 165 

72 0 VEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPE 77 9 

166 165 

780 GGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIR 839 

166 165 

84 0 ETETFSDSS P I E 1 1 DEF PTL I SS KTDS FSKLARE YTDLE VSHKS E I ANAPDGAGSLPCTE 8 99 

Ih 

166 GSV 168 

900 LPHDLSLKNI QPKVEEKI SFSDDFSKNGSATSKVLLLPPDVSALGHTQAE I ES I VKPKVL 959 

169 168 

960 EKEAEKKLPSDTEKEDRSPSAI FSADLGKTSWDLLYWRDI KKTGWFGASLFLLLSLTV 1019 

M f 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

169 WDLLYWRDI KKTGWFGASLFLLLSLTV 197 

1020 FS I VS VTAYI ALALLS VTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSN 1079 

MIIIIIIIIIIIIIIIMIMIIIIMIIIIIIMIIIIMIIIIIIIMMIIIMII 

198 FSIVSVTAYIALALLSVTISFRI YKGVI QAIQKSDEGHPFRAYLESEVAI SEELVQKYSN 257 
108 0 SALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS VPV 113 9 



Illllll II IIIIMIMIil MMIIIIIIMMII III 111 lllllhi 

Db 258 SALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPV 317 

Qy 114 0 I YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

IIUIMII IIIIMlllll llllilli II 

Db 318 I YERHQAQI DHYLGLANKSVKDAMAKI QAKI PGLKRKAE 356 



RESULT 
Q8K3G7 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 



Created) 

Last sequence update) 
Last annotation update) 



Q8K3G7 PRELIMINARY; PRT; 357 AA. 

Q8K3G7; 

01-OCT-2002 (TrEMBLrel . 22, 
01-OCT-2002 (TrEMBLrel. 22, 
01-MAR-2003 {TrEMBLrel. 23, 
Nogo-B . 
RTN4 . 

Mus musculus (Mouse) . 
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 
OX NCBI_TaxID=1009 0; 
RN [1] 

RP SEQUENCE FROM N.A. 
RC STRAIN=BALB/C; 

RA Jin W., Li R., Long M., Shen J., Ju G. ; 

RT "Cloning and expression of the mouse Nogo-B protein."; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AY114153; AAM77069.1; -. 

DR MGD; MGI: 1915835; Rtn4 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 357 AA; 38566 MW; 73BB3D17DFDBDF15 CRC64 ; 



Query Match 19.5%; Score 1152.5; DB 11; Length 357; 

Best Local Similarity 27.6%; Pred. No. 6e-50; 

Matches 326; Conservative 12; Mismatches 17; Indels 825; Gaps 9; 
Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKP 59 

llhlll Mill MINI IIIIIIII I I I II I. 'I I I 

Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEEEEEDDEDLEELEVLERKP 60 

Qy 60 AAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPA 119 

llllllllll III Mhll I Illlllllllllll lllllllh II H I 

Db 61 AAGLSAAPVP- -PAA-APLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAAS A 114 

Qy 120 PSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRR 17 9 

II III! Ml III III II M II II IIIIIIIIIIIIM 

Db 115 PSLPPAAAVLPSKLPEDDEPPAR- -PPAPAGASPLAE PAAPPSTPAAPKRR 163 



Qy 

Db 



18 0 GSSGAWXXXXKI MDLKEQPGNTI SAGQEDF PSVLLETAASXPSLS PLSAAS FKEHEYLG 23 9 
164 GS 165 



Qy 



24 0 NLSTVLPTEGTLQENVSEASKEVSEKAKTLLI DRDLTEFSELEYSEMGSSFSVSPKAESA 299 



Db 



166 



165 



Qy 300 VI VANPREEI IVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKR 359 

Db 166 165 

Qy 360 VAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTN 419 

Db 166 165 

Qy 42 0 HEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTD 479 

Db 166 165 

Qy 48 0 EKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPD 53 9 

Db 166 165 

Qy 540 LVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPD 599 

Db 166 165 

Qy 600 IVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKVSGIK 659 

Db 166 165 

Qy 660 EEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSEL 719 

Db 166 165 

Qy 720 VEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPE 779 

Db 166 165 

Qy 78 0 GGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIR 83 9 

Db 166 165 

Qy 840 ETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTE 899 

Ih 

Db 166 GSV 168 

Qy 900 LPHDLSLKNI QPKVEEKI SFSDDFSKNGSATSKVLLLPPDVSALGHTQAE I ES I VKPKVL 95 9 

Db 169 168 

Qy 960 EKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGW-FGASLFLLLSLT 1018 

i-i 1 1 1 ii 1 1 1 1 ii 1 1 1 1 1 r 1 1 1 1 1 1 

Db 169 WDLLYWRDI KKTGVVYFGASLFLLLSLT 197 



Qy 


1019 


VFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYS 

UIIIIIIMMIII MINN MINN IMIIINMMIMIIIIII, Ml 


1078 


Db 


198 


VFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYS 


257 


Qy 


1079 


N S ALGH VN CT I KELRRLF LVDDLVDS LKFAVLMW VF T YVGALFNGLTLL I LALI SLFSVP 

MINIM IMIIMIIIIIII MMIII IMIIIMIIMIII MIIMI Ml 

NSALGH VNST I KELRRLFLVDDLVDS LKFAVLMW VFT YVGALFNGLTLL I LALI SLFS I P 


1138 


Db 


258 


317 



Qy 113 9 VI YERHQAQ I DHYLGLANKN VKDAMAKI QAKI PGLKRKAE 1178 

MIIMIIMIIIIIIIIhllllMIIIIMIIMIIII 

Db 318 VI YERHQAQI DHYLGLANKS VKDAMAKI QAKI PGLKRKAE 357 

RESULT 10 
Q9GM33 



ID Q9GM33 PRELIMINARY; PRT; 179 AA. 

AC Q9GM33; 

DT 01-MAR-2 001 (TrEMBLrel . 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-MAR-2 003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical 19.9 kDa protein. 

OS Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Macaca. 

OX NCB I_Tax I D= 9 5 4 1 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain ; 

RA Osada N. , Hida M., Kusuda J., Tanuma R. , Iseki K. , Hirai M. , Terao K: , 

RA Suzuki Y., Sugano S., Hashimoto K. ; 

RT "Isolation of full-length cDNA clones from macaque brain cDNA 

RT libraries. "; 

RL Submitted (OCT-2000) to the EMBL/ GenBank / DDB J databases. 

DR EMBL; AB049853; BAB16739.1; -. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 179 AA; 19949 MW; 5F8CD43 83FEE9E02 CRC64 ; 



Query Match 14.6%; Score 8 67; DB 6; Length 179; 

Best Local Similarity 98.3%; Pred. No. 4.1e-36; 

Matches 176; Conservative 3; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1000 


I KKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLS VTI SFRI YKGVI QAI QKSDEGHPF 


1059 


Db 


l 


MKKTGVVFGASLFLLLSLTVFS I VSVTAYIALALLSVTI SFRI YRGVI QAI QKSDEGHPF 


60 


Qy 


1060 


RAYLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGA 


1119 


Db 


61 


IMIIIIIMIIIIIIIIIIIIIMMIIIIIIIMIIIIIIMIhlllMIIIIIIM 

RAYLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLRFAVLMWVFTYVGA 


120 


Qy 


1120 


LFNGLTLLILALISLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

MMIIIIIIIIMIMIIIIIMIIIIMIIMIIIIIMMIIIIMIIIIMMII ' 


Db 


121 


LFNGLTLLI LALI SLFS VPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 17 9 



RESULT 11 
Q90638 

ID Q90638 PRELIMINARY; PRT; 760 AA. 

AC Q90638; 

DT 01 -NOV- 1996 (TrEMBLrel. 01, Created) 

DT 01 -NOV- 1996 (TrEMBLrel. 01, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 



DE ChS-Rex-b. 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=96386034; PubMed=87 93864 ; 

RA Baka I.D., Ninkina N.N., Pinon L.G., Adu J. # Davies A.M., 

RA Georgiev G.P., Buchman V.L.; 

RT "Intracellular compartmentalization of two differentially spliced s- 

RT rex/NSP mRNAs in neurons . " ; 

RL Mol. Cell. Neurosci. 7:28 9-303(1996). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain ; 

RX MEDLINE-97183663; 

RA Ninkina N.N., Baka I.D., Buchman V.L.; 

RT "Rat and chicken s-rex/NSP mRNA: nucleotide sequence of main 

RT transcripts and expression of splice variants in rat tissues. "; 

RL Gene 184:205-210(1997). 

DR EMBL; U17606; AAC60075.1; 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 760 AA; 82502 MW; 4 65C1B42 9F799D5C CRC64 ; 

Query Match 13.3%; Score 788.5; DB 13; Length 760; 

Best Local Similarity 27.7%; Pred. No. 2.4e-31; 

Matches 296; Conservative 115; Mismatches 315; Indels 341; Gaps 38; 

' P P P PAS VS PQAE PVWTP PAPAPAAP PSTPAAPKRRGSSGAWXXXXKI MDLKEQPGNT 2 02 

I ^ II I II I I II :| 

'DPQDLLLAGTAER-W AAAGAD E YAAGAALRDGDGA QQREQL 4 5 

AGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEV 262 

: I I : I I I! II ■■ ■ I hi =::| 

GSAREHPPVAMATA SPGVTASSRLFDY GSSSANGADSSFYT 88 

:KAKTLLID RDLTEFSELEYSEMGSSFSVSPKAESAVI VANPREEI IVKNKDE 316 

hi II I h : I : I II : =1 I 



I I = =11 I Ih I : 

LDSGIEMTPAESA EVDKSLTDPMKVEGYKYMDI 178 



Qy 


143 


Db 


4 


Qy 


203 


Db 


46 


Qy 


263 


Db 


89 


Qy 


317 


Db 


140 


Qy 


377 


Db 


179 


Qy 


435 


Db 


207 



II 

- YQEKHDPDSEDES PDL I DEYR 206 



hhl I M 



Qy 495 NTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANM PE GLTP 538 

I I I I I = I = III : II I I 

Db 254 VKITLTETPGAREATSKEASVTQPKSGLKPSHEWPTVMVSEPEDDSPGSVTPPSSGTEP 313 

Qy 53 9 DLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLP 598 

: II Mill I II I h I II 

Db 314 SGSESQGKGSLSEDELISAIKEAKGFSFETSEVQQS PAV SAEKQEQKMKPGRP 366 

Qy 599 DIVMEAPLNSAVPSA- -GASVTQ- PSSSPLEASSV NYESIKHEPENPPPYEEAMSVS 652 

' Ml- II I I h I II I I || : | Ml I I 

Db 367 - -AVPSPLDNEASSAESGDSEIELVSEDPLAAEEVLHSNYMTFSH- IGGPPP- -SPASPS 421 

Qy 653 LKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQP 712 
Db 422 IQYSILREERE AELDSELI IESCD ASSASEESP KREQD 459 

Qy 713 VPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEK 772 

I =1 I -II II : 

Db 460 S PLMKPMVMD IIKE ENSSR 478 

Qy 773 LSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVYSNDDLFI 832 

II I . 

Db 47 9 ASASDYE 48 5 

Qy 833 SKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGA 8 92 

MM I- II IN 
Db 486 AS KTTE - S RMNRE N LADS A 503 

Qy 893 GSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIES 952 

I ' : M I : III::: 
Db 504 SYLKSS FVAPKVSSE PPTSAVSTEELKERI 533 

Qy 953 I VKPKVLEKEAEKKLPSDTEKEDRSPSAI -FSADLGKTSWDLLYWRDI KKTGWFGASL 1011 

I : I : I I : II I I : II -lllllllhlhllh I 

Db 534 I LKKP I E ETVVNQS KVS SKDSGKRSPLALPLLPFLNKQKA I NLLYWRD I KQTG I VFGSLL 593 

Qy 1012 FLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISE 1071 

M Ml I hill I hill M I I II III I hi hi hi M I Ih II h h :h 
Db 594 LLLFSLTQFSWSWAYLALAGLSATISFRIYKSVLQAVQKTDEGHPFKAYLDMEMNLSQ 653 

Qy 1072 ELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LAL 1131 

= = 111- Ml hlMMIMI MIMMIIIIIh IIIIIIIIMIIIhh 

D] 3 654 DQIQKYTDCLQLYVNSTVKELRRLFLVQDLVDSLKFAVLMWLLTYVGALFNGLTLLIMAV 713 

Qy 1132 1 SLFS VPVI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

H : h : I h h : : I I II I I I I I - Mllllllll Mill 
Db 714 VSMFTLPWYDKYQAQIDQYLGLWTHINTWAKI QAKI PGAKRKAE 760 



RESULT 12 
Q8K4S4 

ID Q8K4S4 PRELIMINARY; PRT; 780 AA. 

AC Q8K4S4; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 
DT 01-OCT-2002 (TrEMBLrel . 22, Last sequence update) 
DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 
DE Reticulon 1A. 



GN RTN1 OR RTN-1A. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=ICR; TISSUE=Brain ; 

RA Hirata T. , Nomura T. , Takagi Y . , Sato Y., Tomioka N. , Fujisawa H. , 

RA Osumi N . ; 

RT "Mosaic development of the olfactory cortex with Pax6 -dependent and 

RT -independent components."; 

RL Submitted (NOV-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AB074899; BAB96551.1; 

DR MGD; MGI: 1933947; Rtnl . 

DR InterPro; IPR001951; Histone_H4 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS00047; HIST0NE_H4; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 780 AA; 83504 MW; 545F5638C576A069 CRC64; 

Query Match 13.2%; Score 779; DB 11; Length 780; 

Best Local Similarity 30.8%; Pred. No. 7.5e-31; 

Matches 236; Conservative 108; Mismatches 227; Indels 194; Gaps 28; 

QY 53 8 PDLVQEACESELNEVTG TKIAYETKMDLVQTSEVMQ-ESLYPAAQLCPSFEES 589 

II : = I I : I : : I : I = ::| h : | || 

Db 86 PDALDHSPSSTLKDGEGACYTSLI SDVCYPPREDSAYFTGI LQKENGHI TTSES P - - EEP 143 

Qy 590 EATPSPVLPDIVMEAP LNS AVPSAGASVIQPSSSPLE ASSVNY 632 

I II I II- I hi h I : : lh | : | 

Db 144 E-TPGPSLPEVPGMEPQGLLSSDSGIEMTPAESTEVNKILADPLDQMKAEAYKYIDITRP 202 

Qy 633 ESI KHEPENPPPYEE AMS VSLKVSGI KEE I KEPEN I NAAL QETEA 677 

1 I 1 I I I : M I I- =1 = I : I I 

Db 2 03 QEAKGQEEQHPGLEDKDLDFKDKGTEVSTKAEGVRAP -NQPAPVEGKLI KDHLFEESTFA 2 61 

Qy 678 PYISIACDLIKETKLSAEPAPDFSDYSE MAKVEQPVPDHSELVEDSSPDSEPVDL 732 

Ml I - -I II *l I - :|: :| |||: | 

Db 262 PYIDELSD- -EQHRVSLVTAPVKITLTEIEPPLMTATQETI PEKQDLCLKPSPDTVPTVT 319 

Qy 733 FS DDS IPDVPQKQ DET VMLVKESLTETSFESMI EYENKE 771 

I III I : I || : :||: : || | 

Db 320 VSEPEDDSPGSVTPPSSGTEPSAAESQGKGSVSEDELIAAIKEA KGLSYETTE 372 

Qy 772 KLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVYSNDDLF 831 

M = I I || :::::: | | 
Db 373 SPRPVGQVADKP KTKTRSGLPTI PSPLDQEASSAESGD 410 

Qy 832 ISKEAQIRETETFSDSSPIEI IDEFPTL 1 SSKTDSFSKLAREYTDL 877 



Db 



411 



* I I • • • I - I 

SEIELVSEDPMASEDALPSGYVSFGHVSGPPPSPASPSIQYSIL 454 



Qy 



878 



EVSHKSEIANAPDGAGSLP 



CTELPHDLS 905 



Db 



455 REEREAELDSELI IESCDASSASEESPKREQDSPPMKPGALDAIREETGSRATEERAPSH 514 



Qy 906 LKN I Q P KVE EKISFSDDFS KNGS ATS KVLLL P PD VS ALGHTQAE I E S I VKP KVL E KEAE K 965 
Db 515 QGPVEP- -DPMLSF APAAALQSRPEPSS GDGASVPEP PRSQQQ 555 

Qy 966 KLPSDTEKEDRSPSAI -FSADLG KTSWDLLYWRDI KKTGWFGASLFL 1013 

I ^ H I : I II | -I II I M I U hi I I- | 

Db 556 KPEEEAVSSSQSPTATEI PGPLGSGLMPPLPFFNKQKAIDLLYWRDIKQTGI VFGSFLLL 615 

Qy 1014 LLSLTVFS I VS VTAYI ALALLSVTI SFRI YKG VI QAI QKSDEGHPFRAYLESEVAI SEEL 1073 

I Ml Ihlll Ihlll II IMIIIII hlhlhlllllhllll h MM 
Db 616 LFSLTQFSWSWAYLALAALSATISFRI YKSVLQAVQKTDEGHPFKAYLELEITLSQEQ 675 

Qy 1074 VQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALIS 1133 

: I I I : : HI hlllllllll IMIIIMIIIIh I I I II I I | | I | | h = h = | 

Db 676 IQKYTDCLQLYVNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGALFNGLTLLLMAWS 735 

Qy 1134 LFSVPVI YERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

= 1-11 = 1 MIMM MM :: MMIIMM M II 

Db 736 MFTLPVVYVKHQAQVDQYLGLVRTHI NTWAKIQAKI PGAKRHAE 78 0 

RESULT 13 
Q8K0T0 

ID Q8K0TO PRELIMINARY; PRT; 780 AA. 

AC Q8K0T0; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to reticulon 1. 

GN RTN1 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus . 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Retina; 

RA Strausberg R. ; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; BC030455; AAH30455.1; -. 

DR MGD; MGI: 1933947; Rtnl . 

DR InterPro; IPR001951; Histone_H4 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS00047; HISTONE_H4; 1. 

DR PROSITE; PS5084 5; RETICULON; 1. 

SQ SEQUENCE 780 AA; 83572 MW; 2 9B4 7A58FC2F2 02 7 CRC64 ; 



Query Match 13.1%; Score 778; DB 11; Length 78 0; 

Best Local Similarity 30.8%; Pred. No. 8.4e-31; 

Matches 236; Conservative 108; Mismatches 227; Indels 194; Gaps 28; 

Qy 53 8 PDL VQEACES ELNEVTG TKIAYETKMDLVQTSEVMQ-ESLYPAAQLCPSFEES 58 9 

II : = I I : I : : I : I • "I \ ■ ■ III 

Db 8 6 PDALDHS PS STLKDGEGACYTSLI SDVCYP PREDSAYFTG I LQKENGH I TTS ESP - - EE P 143 



Qy 590 EATPSPVLPDIVMEAP LNS AVPSAGASVIQPSSSPLE- - -ASSVNY 632 

I II I lh: I hi h | : : ||: | : | 

Db 144 E-TPGPSLPEVPGMEPQGLLSSDSGIEMTPAESTEVNKILADPLDQMKAEAYKYIDITRP 202 

QY 633 ESIKHEPENPPPYEEA M S VS L KVS GIKEEIKEPENI NAAL QETEA 677 

: I : I I I : II I I- :| : | : | | 

Db 203 QEAKGQEEQHPGLEDKDLDFKDKDTEVSTKAEGVRAP-NQPAPVEGKLIKDHLFEESTFA 261 

QY 678 PYISIACDLIKETKLSAEPAPDFSDYSE MAKVEQP VPDHS ELVEDS S PDS EP VDL 732 

Ml I - -I || :| | :: :|: :| |||: | 

Db 262 P Y I DELSD - - EQHRVSLVTAPVKI TLTE I EP PLMTATQETI PEKQDLCLKPS PDTVPTVT 319 

Qy 733 FS DDS IPDVPQKQ DETVMLVKESLTETSFESMI EYENKE 771 

I III I : I II : :|h : || | 

Db 320 VSEPEDDSPGSVTPPSSGTEPSAAESQGKGSVSEDELIAAIKEA KGLSYETTE 372 

Qy 772 KLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLF 831 

Db 373 SPRPVGQVADKP KTKTRSGLPTI PS PLDQEASSAESGD 410 

Qy 832 ISKEAQIRETETFSDSSPlEI IDEFPTL 1 SSKTDSFSKLAREYTDL 877 

I II- II :| | : : :|= | 

Db 411 SEIELVSEDPMASEDALPSGYVSFGHVSGPPPSPASPSIQYSIL 454 

Qy 878 EVSHKSEIANAPDGAGSLP CTELPHDLS 905 

1111=1 hi M 

Db 455 REEREAELDSELI I ESCDASSASEESPKREQDSPPMKPGALDAI REETGSRATEERAPSH 514 

Qy 906 LKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEK 965 

= = I = = I I i 1=1 I : | = : | : : | ce 45 : : 
Db 515 QGPVEP- -DPMLSF APAAALQSRPEPSS GDGASVPEP PRSQQQ 555 

Qy 966 KLPSDTEKEDRSPSAI -FSADLG KTSWDLLYWRDI KKTGWFGASLFL 1013 

I : =11 = 1 II I : I I I I I I I I I : I I : | | | : | | 

Db 556 KPEEEAVSSSQSPTATEIPGPLGSGLMPPLPFFNKQKAIDLLYWRDIKQTGIVFGSFLLL 615 

Qy 1014 LLSLTVFS I VS VTAYI ALALLS VTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEEL 1073 

I Ml INN MMM II MINIM h II : I I : I I I I I I : I I I I h MM 

Db 616 LFSLTQFSWSWAYLALAALSATISFRI YKSVLQAVQKTDEGHPFKAYLELEITLSQEQ 675 

Qy 1074 VQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALIS 1133 

MM- Ml IMIIIIMM Mill MINIM: 1 1| I II II 1 1 1 1 1 = M = = I 

Db 676 IQKYTDCLQLYVNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGALFNGLTLLLMAWS 73 5 

Qy 1134 LFS VPVI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 117 8 

OX M:MIM MIMM3||^775X:475X$|||||||| || || 

Db 736 MFTLPWYVKHQAQVDQYLGLVRTHINTWAKIQAKIPGAKRHAE 78 0 



RESULT 14 
Q8CCU2 

ID Q8CCU2 PRELIMINARY; PRT; 643 AA. 

AC Q8CCU2; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 
DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 
DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 
DE Reticulon 3. 



OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE-Medulla oblongata; 

RX MEDLINE=22354683; PubMed-12466851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK032109; BAC27708.1; -. 

SQ SEQUENCE 643 AA; 68769 MW; 15E3335B280533F8 CRC64 ; 

Query Match 11.8%; Score 699; DB 11; Length 643; 

Best Local Similarity 31.9%; Pred. No. 5.6e-27; 

Matches 201; Conservative 95; Mismatches 186; Indels 148; Gaps 20; 

Qy 591 ATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMS 650 

M Ih |||:: : I h | | :|:: | | 
Db 120 ATVEVTLPN LRGAWPNSVMGEVTEVDSSGESDDTVIEDITEKPDSLP S 167 

QY 651 VSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVE 710 

: I I : I I I I = :|| 

Db 168 AAAKTS - - ERE I KETPS RETV 186 

QY 711 QPVPDHSELVEDS - SPDSEPVDLFSDDS I PDVPQKQDETVMLVKESLTETSFESMI EYEN 769 

I I : I : I I : : I I : I I =1 I : I : I 

Db 187 RSEMCENSEQPQAQP ETPTQKS LEG EVAS QVPNTLNEVTPEKLDMTNN 234 

Qy 770 KEKLSALPP EGG KPYLESFKLSLDNTKDTLLPDEVSTL- -SKKEKI PLQ 816 

= M II M III I |::= : :|| = 

Db 235 PKVCSAAPPSVLNETGFSLTVPASAKLESLLGKYVEDTDGSSPEDLMAVLTGAEEKGIVD 294 

Qy 817 MEELSTAVYSNDDLFI SKEAQI RETETFSDSSPI EI I DEFPTLI SSKT DSFSKLAR 872 

M M M : : | :: |:|:: | | |:| :|: :| 

°b 295 KEE GDVL EAVLEKIADFKNTLPVELLHE-SELSGSETKNIKSKYSEDSR 342 

Qy 873 EYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKI 917 

II II : I :| -I : : Ih 

Db 343 ETT GGAPTMSPDLEQEQLT- IRAIKELGERQAEKVQDEGISSGGKLKQTF 3 91 

Qy 918 S FS DDF S KNG SATS KVLLLP PDVSALGHTQAE - 1 ES I VKPKVTjEKEAEKKLPS 969 

Ml I | =| : : : :|: : | |: | ::: | :| | 

Db 392 APQSGPQSSSDILEHTDVKTGSDLGIPKNPTIIKNTRIDSISSLTKTEMVNKNVLARLLS 451 

Qy 970 DTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYI 1029 



Db 



452 D 




Qy 




Db 



Qy 



1090 KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS VPVI YERHQAQI D 114 9 



I : MIIMIIIMI II lh Mllhllh II MM : MllhMh- III 

Db 554 KLI I RLFLVEDLVDSLKIAVFMWLMTYVGAVFNGI TLLI IAELLVFS VPI VYEKYKTQI D 613 

Qy 1150 HYLGLANKNVKDAMAKIQAKIPGL-KRKAE 1178 

Ihh | : MMhlh hill 

Db 614 HYVGIARDQTKSIVEKIQAKLPGIAKKKAE 643 



RESULT 15 
Q9BQ59 



ID Q9BQ59 PRELIMINARY; PRT; 199 AA. 

AC Q9BQ59; 

DT 01-JUN-2001 (TrEMBLrel . 17, Created) 

DT 01-JUN-2001 (TrEMBLrel . 17, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to reticulon 1. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE^Lung; 

RA Strausberg R.; 

RL Submitted (FEB-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC003003; AAH03003.1; -. 

DR EMBL; BC000314; AAH00314.1; 

DR InterPro; IPR001951; Histone_H4 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS00047; HISTONE_H4; 1. 

DR PROSITE; PS5 0845; RETICULON; 1. 

SQ SEQUENCE 199 AA; 22642 MW; 7CFA44CC568DF6D8 CRC64; 



Query Match 11.6%; Score 688; DB 4; Length 19 9; 

Best Local Similarity 67.5%; Pred. No. 4e-27; 

Matches 129; Conservative 32; Mismatches 30; Indels 0; Gaps 0; 



Qy 


988 


KTSVVI)LLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 


1047 


Db 


9 


h 1 M 1 ' 1 II III Ihlll II M II II III h 

KSQAIDLLYWRDIKQTGI VFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVL 


68 


Qy 


1048 


QAIQKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKF 


1107 


Db 


69 


M M 1 M 1 II 1 1 M 1 1 1 h MM : ! Ml IMIIIIIIII llllllll 

QAVQKTDEGH PFKAYLELE I TLSQEQ I QKYTDCLQF YVNSTLKELRRLFLVQDLVDSLKF 


128 


Qy 


1108 


AVLMWVFT YVGALFNGLTLL I LAL I SLFS VPVI YERHQAQ I DHYLGLANKNVKDAMAKI Q 


1167 


Db 


129 


MMh IMIIIIIIIII|::|::|:|::|hl :|||||| INI = = HIM 
AVLMWLLTYVGALFNGLTLLLMAWSMFTLPWYVKHQAQI DQ YLGLVRTH I NAWAKI Q 


188 


Qy 


1168 


AKI PGLKRKAE 1178 




Db 


189 


Mill 1 1 II 

AKI PGAKRHAE 199 





Search completed: January 22, 2004, 16:34:12 
Job time : 68.624 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



January 22, 2004, 16:34:19 ; Search time 58.0516 Seconds 

(without alignments) 
4195.163 Million cell updates/sec 

US-09-830-972-29 
5923 

1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKI QAKI PGLKRKAE 1178 

BLOSUM62 

Gapop 10.0 , Gapext 0 . 5 



Searched : 



777136 seqs, 206736638 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



777136 



Post -process ing : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications__AA: * 

1 : /cgn2_6/ptodata/l/pubpaa/US07_PUBCOMB.pep: * 

2 : /cgn2_6/ptodata/l/pubpaa/PCT__NEW_PUB.pep: * 

3 : /cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB.pep 

4 : /cgn2_6/ptodata/l/pubpaa/US06_PUBCOMB.pep 

5 : /cgn2_6/ptodata/l/pubpaa/US07JNfEW_PUB.pep 

6 : /cgn2_6/ptodata/l/pubpaa/PCTUS_PUBC0MB.pep: 

7 : /cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB .pep : * 

8 : /cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB.pep: * 

9 : /cgn2_6 /pt oda t a / 1 /pubpaa/US 0 9A_PUBCOMB . pep : 

10 : /cgn2_6/ptodata/l/pubpaa/US09B_PUBCOMB.pep 

11 : /cgn2_6/ptodata/l/pubpaa/US09C_PUBCOMB.pep 

12 : /cgn2_6/ptodata/l/pubpaa/US09_NEW__PUB.pep: 

13 : /cgn2_6/ptodata/l/pubpaa/US10A_PUBCOMB.pep 

14 : /cgn2_6/ptodata/l/pubpaa/US10B_PUBCOMB.pep 

15 : /cgn2_6/ptodata/l/pubpaa/US10C__PUBCOMB.pep 

16 : /cgn2_6/ptodata/l/pubpaa/US10_NEW_PUB.pep 

17 : /cgn2_6/ptodata/l/pubpaa/US60_NEW_PUB.pep 

18 : /cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB.pep 

Pred. No. is the number of results predicted by chance to have a 

score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-09-758-140-6 

; Sequence 6, Application US/09758140 
; Patent No. US20020012965A1 
; GENERAL INFORMATION : 

; APPLICANT: Strit tmat ter , Stephen M . 



; TITLE OF INVENTION: No. US2 0020012965Alo Receptor-Mediated Blockade of 
Axonal Growth 

FILE REFERENCE: 44574 -5073 -US 
CURRENT APPLICATION NUMBER: US/ 09/758 , 14 0 
CURRENT FILING DATE: 2001-01-12 
PRIOR APPLICATION NUMBER: US 60/175,707 
PRIOR FILING DATE: 2000-01-12 
PRIOR APPLICATION NUMBER: US 60/207,366 
PRIOR FILING DATE: 2000-05-26 
PRIOR APPLICATION NUMBER: US 60/236,378 
PRIOR FILING DATE: 2000-09-29 
NUMBER OF SEQ ID NOS : 2 0 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 6 
LENGTH : 1192 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-758-140-6 

Query Match 98.2%; Score 5815; DB 9; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 3.5e-284; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 3; 
QY 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEE EDEDLEELEVLERKPA 60 8625XG 

IMIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIMIIMIMMIIIIMIII 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 

' : ! i i i i i ! i i i i i , i mi i i i n m 

Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

IIIIIIIIIIIMIIIIIIIIIIIIIMIIIMIIIMIIIIIIIIIIIIIIIIIIMII 

Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

Qy 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

^ . h = I I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill 

Db 181 S SGS VDETLFALPAAS E P VI RS SAENMDLKEQPGNT I SAGQEDFPSVLLETAASLPSLS P 24 0 

QY 22 7 LSAASFKEHEYLGNLSTVLPT GTLQENVS EAS KE VS EKAKTLL I DRDLTEF S ELE YS EM 2 86 

MMIMIIIIIIIIIMMIIIIIMIIIIIMIIIIIMIIMIIMIIIIMIIIII 

Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVS EAS KEVS EKAKTLL I DRDLTEFS ELE YS EM 3 00 

Qy 287 GSS FS VS PKAESAVI VANPREE 1 1 VKNKDEEEKLVSNN I LHXQQELPTALTKLVKEDE W 346 

MIMIIMIIMIIIIIIIIIMIIIIIIIIIMIMIII 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 

Db 301 GSSFSVSPKAESAVI VANPREEI I VKNKDEEEKLVSNN I LHNQQELPTALTKLVKEDEW 360 

Qy 34 7 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 4 06 

1 1 f 1 1 1 1 1 1 1 1 1 1 J 1 1 J 1 1 r 1 1 1 1 f 1 1 f j i j 1 1 1 1 r 1 1 1 j j 1 1 1 j 1 1 j 1 1 1 j j 1 1 r r i j i 

Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 420 

Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSF PSTPEG I KDRSGA Y I TCAPFNPAATES I ATN I F 466 

MIIIIIIIIMIIIIIIIIIMIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIMII 

Db 421 DKKCFADSLEQTNHEKDSESSNDDTSF PSTPEG I KDRSGAYI TCAPFNPAATES I ATNIF 480 

Qy 4 QP 0 PLLEDPTS ENXTDEKKI EEKKAQ I VTEKNTSTKTSNPFFVAAQDSETDY VTTDNLTKVTE 526 

Ml MMII 1 1 ) 1 1 1 1 1 1 1 1 1 1 f 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 



Db 


481 


PLLGDPTSENKTDEKKI EEKKAQI VTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 


540 


Qy 


527 


EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 
1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


586 


Db 


541 


EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 


600 


Qy 


587 


EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 

1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 


646 


Db 


601 


660 


Qy 


647 


EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 


705 


Db 


661 


1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 I I I I || 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 II 1 1 
EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 


720 


Qy 


706 


MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 


765 


Db 


721 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 I 1 1 1 1 1 1 I I 

MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 


780 


Qy 


766 


EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 


825 


Db 


781 


IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIMIIIIIIII 

EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI plqmeelstavy 


840 


Qy 


826 


SNDDLF I S KEAQI RETETFSDSS P I EI I DEFPTLI SSKTDSFSKLARE YTDLEVSHKS E I 
1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


885 


Db 


841 


1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 I I I I M I I | | M | | | | | | | | | | | | | | | | | 

SNDDLFI SKEAQI RETETFSDSSPI EI IDEFPTLI SSKTDSFSKLAREYTDLEVSHKSEI 


900 


Qy 


886 


ANAPDGAGSLPCTELPHDLSLKNI QPKVEEKI SFSDDFSKNGSATSKVLLLPPDVSALGH 

IIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIII 

ANAPDGAGSLPCTELPHDLSLKNI QPKVEEKI SFSDDFSKNGSATSKVLLLPPDVSALA- 


945 


Db 


901 


959 


Qy 


946 


TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 


1005 


Db 


960 


| | | | | | | | | | | | | | | | | | || | | | | | | | | | | | | | | | | | | : | | | | || | || | | | | | | | | || 

TQAEI ES I VKPKVLVKEAEKKLPSDTEKEDRS PSA I FSAELSKTSWDLLYWRDI KKTGV 


1019 


Qy 


1006 


VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLES 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | || | | | | | | | | | | || | | | 

VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLES 


1065 


Db 


1020 


1079 


Qy 


1066 


EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1125 


Db 


1080 


IIIIIIIIIMIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIMIIIIIMIIIIMII 

EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1139 


Qy 


1126 


LLI LAL I SLFS VPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 




Db 


1140 


IIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIII 

LLI LAL I SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 





RESULT 2 

US-09-972-599A-6 

; Sequence 6, Application US/09972599A 
; Patent No. US20020077295A1 
; GENERAL INFORMATION: 

; APPLICANT: STRITTMATTER , STEPHEN M . 

; TITLE OF INVENTION: NOGO- RECEPTOR-MEDIATED BLOCKADE OF AXONAL GROWTH 
; FILE REFERENCE: C077 CIP US 

; CURRENT APPLICATION NUMBER: US/09/972 , 599A 
CURRENT FILING DATE: 2001-10-06 



PRIOR APPLICATION NUMBER: PCT/USOl/01041 

PRIOR FILING DATE: 2001-01-12 

PRIOR APPLICATION NUMBER : 09/758,140 

PRIOR FILING DATE: 2001-01-12 
; PRIOR APPLICATION NUMBER: 60/236,378 

PRIOR FILING DATE: 2000-09-29 

PRIOR APPLICATION NUMBER : 60/2 07,366 
; PRIOR FILING DATE: 2000-05-26 

PRIOR APPLICATION NUMBER: 60/175,707 . 

PRIOR FILING DATE: 2000-01-12 
; NUMBER OF SEQ ID NOS : 57 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 6 

LENGTH: 1192 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-972-599A-6 

Query Match 98.2%; Score 5815; DB 9; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 3.5e-284; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 3; 



Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 6 0 

iMMIMIIMIIIIi IIIIIIIIIMIIIIIIIIIIIIIMIIIIII IIIIIIMM 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 6 0 

QY 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 

IIIIIIIIIIIIIIIIMIIIIIIIMMIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 

Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

Qy 181 SSGA - -WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

Mh h : I I M I I I I I I I I I I I I I I I I I I I | | | | | MM 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 24 0 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

I i I i 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 ! E f 1 1 1 ! 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 

Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 3 00 

Qy 287 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 346 

1 1 1 J 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 ! 1 1 1 1 f I i 1 1 1 1 1 j 1 1 j i 1 1 j f 

Db 301 GSSFSVSPKAESAVIVANPREEI I VKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 406 

MIIMIIIIIMIIMIIIIIIIIIMMIMIIMIMIIIIIIIIIIIIIMIIIII 

Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 420 

Qy 4 07 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 4 66 

1 1 1 1 1 f i r 1 1 1 1 i i r i f 1 1 1 1 r 1 1 1 r 1 1 1 f j j 1 1 1 1 1 1 r 1 1 r 1 1 1 j 1 1 j 1 1 1 j 1 1 1 r 1 1 1 

Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 480 

Qy 467 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 

III MINI IIIIIIIIIIIIIMMIIMIIIMI ; i M 

Db 481 PLLGDPTSENKTDEKKI EEKKAQI VTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 540 



Qy 


527 


EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 


586 


Db 


541 


EWANMPEGLTPDLVQEACESELNEOTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 


600 


Qy 


587 


EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSWYESIKHEPENPPPYE 


646 


Db 


601 


I MMI 1 1 1 1 Ml 1 U 1 1 1 1 1 M M M! 1 I M 

EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 


660 


Qy 


647 


EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 


705 


Db 


661 


Mlllll M II . II M MM II 1 1 1 > 1 1 1 1 1 Ml M Ml M . M 1 1 1 1 1 1 M 1 II II 

EAMSVSLKKVSGI KEEI KEPENINAALQETEAPYI SI ACDLI KETKLSAE PAPDFSDYSE 


720 


Qy 


706 


MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 


765 


Db 


721 


1 II M II M IIIIMIIIIIIIIIIIIIIIIIIMMlll MM llllll llllllll II 

MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDS I PDVPQKQDETVMLVKESLTETSFESMI 


780 


Qy 


766 


EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 


825 


Db 


781 


1 1 1 1 1 M 1 1 Ml 1 i Ml 1 Ml !M 1 1 MM 

EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 


840 


Qy 


826 


SNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKIiAREYTDLEVSHKSEI 


885 


Db 


841 


IIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIMMIIIMIIIIIIIIIIIII 

SNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


900 


Qy 


886 


ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 

IIIIIIIIIIIIIMMMIIIIIIIIIIIIIIIMMIMIIIIIIIIIIIMIIII 


945 


Db 


901 


ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 


959 


Qy 


946 


TQAEI ES I VKPKVLEKEAEKKLPSDTEKEDRSPSAI FSADLGKTSWDLLYWRDI KKTGV 


1005 


Db 


960 


MMMMIMM MIMMMMMIIMM ' M 1 1 II 1 1 1 1 1 1 M II 1 1 1 

TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 


1019 


Qy 


1006 


VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLES 


1065 


UD 


1 A O A 


IIIIIIIIIIIMIMIMIIIIIIIIIIIMIMIMIIIIIIIIIIIMIIIIIIIII 

VFGASLFLLLSLTVFS I VSVTAY I ALALLSVT I SFR I YKGVIQAIQKSDEGHPFRAYLES 


1079 


Qy 


1066 


EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1125 


Db 


1080 


M II II 1 MM M Ml M 1 MM 1 1 M 1 1 1 1 1 M 1 MM 1 1 II M 1 1 1 Ml M 1 1 

EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1139 


Qy 


1126 


LLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

MIIIIIIIIMIIIMIIIIIIIIMIIIIIIIIIMIMIIIIMIIIIII 




Db 


1140 


LL I LAL I SLFSVPVI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 





RESULT 3 

US-10-060-036-71 

Sequence 71, Application US/10060036 
Publication No. US20030073144A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Benson, Darin R. 
Kalos, Michael D. 
Lodes, Michael J. 
Persing, David H . 
Hepler, William T. 
Jiang, Yuqiu 



TITLE OF INVENTION; COMPOSITIONS AND METHODS FOR THE THERAPY 
TITLE OF INVENTION: AND DIAGNOSIS OF PANCREATIC CANCER 
FILE REFERENCE: 210121.566 

CURRENT APPLICATION NUMBER: US/ 10/060 , 036 
CURRENT FILING DATE: 2002-01-30 
NUMBER OF SEQ ID NOS : 4560 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 71 
LENGTH: 1192 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-060-036-71 

Query Match 98.2%; Score 5815; DB 15; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 3.5e-284; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 3; 
QY 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

MMIMIIIIIIIIIMIIIMMIIIIIIMIIMIIIIIIIIIIIMIIIIIIIlll 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

QY 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 

1 1 1 M 1 1 1 1 1 ! 1 1 1 1 1 1 1 M 1 1! If II M 1 1 1 1 M I M I M 1 1 1 M 1 1 1 1 M 1 1 1 1 ( 1 1 1 

Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

, 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGA-- WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

Ml: h : I I I I I M I I I I I I I I I I I I I I I I I I M I Mill 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLI DRDLTEFSELEYSEM 286 

1 1 1 1 1 M I I I I I I I II I I I I I I I I I | | | | | | | | | | | | | || | | | | | | | | | M | | || | | | | | 
Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLI DRDLTEFSELEYSEM 3 00 

Qy 287 GSSFSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEW 346 

,m 'U'!'! 1 " Ml Ml IMIIIIIIIIIIIIIIIII llllllllllllllllll 

Db 3 01 GS S FS VS P KAESAVI VAN PRE E 1 1 VKNKDE EEKLVSNN I LHNQQELPTALTKL VKEDE W 360 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 4 06 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I I I I I I I II I 
Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 420 

Qy 4 07 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 4 66 

1 1 1 1 1 M II I I I I I I I I I I I I | | | | | | | | | | | | | | | | | || | | M | | || | | | || | | | | | || 
Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 480 

Qy 4 67 PLLEDPTSENXTDEKKI EEKKAQI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 52 6 

IN MINI IIIMIIIIIMIIMIIIIIIIIIII IMIIIIIMIIMIIIIIII 

Db 481 PLLGDPTSENKTDEKKI EEKKAQI VTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 54 0 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

I M I II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 

Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 



Qy 


587 


EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 

1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 
EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 


646 


Db 


601 


660 


Qy 


647 


EAMSVS L - KVSG I KE E I KE P EN I NAALQ ETEAP Y I S I ACDL I KETKLSAE PAPDFSDYSE 


705 


Db 


661 


MM MMMMMM IMMMI MMMMMMMMMMMMIMM 

EAMS VSLKKVSGI KEEI KEPENI NAALQETEAPYI S I ACDLI KETKLSAEPAPDFSDYSE 


720 


Qy 


706 


MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 


765 


Db 


721 


1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 


780 


Qy 


766 


EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 li 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 
EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 


825 


Db 


781 


840 


Qy 


826 


SNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


885 


Db 


841 


1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 I 1 I I 
SNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


900 


Qy 


886 


ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 


945 


Db 


901 


1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 


959 


Qy 


946 


TQAEIESI VKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSVVDLLYWRDIKKTGV 


1005 


Db 


960 


| | | | | | | | | | | | | | | | | | | | I | | | | | | | | | | | | | I | I | : | | | | | | | | | | | | | | | | | | | 
TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 


1019 


Qy 


1006 


VFGASLFLLLSLTVFS I VSVTAYI ALALLS VTI SFRI YKGVI QAI QKSDEGHPFRAYLES 


1065 


Db 


1020 


1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I || || I I I I I I | | | | | | | | | | | | | | | | | 
VFGASLFLLLSLTVFS I VSVTAYI ALALLS VTI SFRI YKGVI QAI QKSDEGHPFRAYLES 


1079 


Qy 


1066 


EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1125 


Db 


1080 


MMIMMMMMMMMIMMM IMMMM MMIIMI MMIMM 

EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1139 


Qy 


1126 


LLILALI SLFS VP VI YERHQAQ I DH YLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 




Db 


1140 


IIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIM 

LL I LAL I SLFS VPVI YERHQAQ I DH YLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 





RESULT 4 
US-09-789-386-2 

; Sequence 2, Application US/09789386 
; Patent No. US20020010324A1 
; GENERAL INFORMATION: 
; APPLICANT: MICHALOVICH, DAVID 
; APPLICANT: * PRINJHA, RABINDER KUMAR 
; TITLE OF INVENTION: NOVEL COMPOUNDS 
FILE REFERENCE: GP-30165-C1 

CURRENT APPLICATION NUMBER: US/0 9/78 9 , 3 8 6 
; CURRENT FILING DATE: 2001-02-21 

PRIOR APPLICATION NUMBER: U.K. 9916898.1 

PRIOR FILING DATE: 1999-07-19 
; PRIOR APPLICATION NUMBER: U.K. 9816024.5 

PRIOR FILING DATE: 1998-07-22 
; PRIOR APPLICATION NUMBER: US 09/359,208 



PRIOR FILING DATE: 1999-07-22 
; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 2 

LENGTH: 1192 
TYPE : PRT 

ORGANISM: HOMO SAPIENS 
US-09-789-386-2 

Query Match 98.1%; Score 5810; DB 9; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 6.2e-284; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3; 



Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I Ml I I I I II I ! Ml f i :,i ,| | 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFWEPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

I M' I I, I I I I I I I I I h i ; I I I ! I I I | 

Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

IIIIIIMIIIIIIIIMIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 
QY 181 SSGA WXXXXKI MDLKEQPGNTI SAGQEDFPSVLLETAASXPSLS P 226 

Mh h ^ Mil MliM Mlllll III Mill Mill 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

QY 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 28 6 

. ■ MM 

Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Qy 2 87 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 34 6 

IIMIMIIIIIIMIIMMIIIIIMIIIIMIIIIIII II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 301 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 4 06 

I I I I I I I I I I I I I';' I Ml I I M ; I ,,| I 

Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 420 

Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGI KDRSGAYI TCAPFNPAATES IATNI F 466 

M MIMM IIMMI MMMMIMIMM I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 

Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEG I KDRPGAYI TCAPFNPAATES I ATN IF 480 

Qy 467 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 

mi linn iiiiiiiiiiiiiiiiiiiiiiiiiii iiiiiiiiiiiiiiiiiiiii 

Db 481 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 54 0 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

IMMMIIIMIMIIIIIIIIIMIMIMIMIIIIIIIIMIIIIIIIIIIIMM 

Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

: I 1 1 1 1 ! 1 1 ' i I 1 1 1 1 1 1 1 1 i 1 1 ■ I ! 1 1 1 1 1 1 1 MM 1 1 1 

Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 



Qy 


647 


EAMSVSL- KVSG I KEEI KEPENINAALQETEAPYI SIACDLI KETKLSAEPAPDFSDYSE 


705 


Db 


661 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 II 1 1 I || I I I I I I I I I I I I 
EAMSVSLKKVSGI KEEI KEPENINAALQETEAPYI SIACDLI KETKLSAEPAPDFSDYSE 


720 


Qy 


706 


MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFESMI 
1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 | | | | | | | | | | | | M | | | | | | | | | | | M 

MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFESMI 


765 


Db 


721 


780 


Qy 


766 


EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 


825 


Db 


781 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 


840 


Qy 


826 


SNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


885 


Db 


841 


M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 | | | | 

SNDDLFISKEAQIRETETFSDSSPIEI I DEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


900 


Qy 


886 


ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 I I I 


945 


Db 


901 


ANAPDGAGSLPCTELPHDLSLKNI QPKVEEKI SFSDDFSKNGSATSKVLLLPPDVSALA- 


959 


Qy 


946 


TQAEI ESI VKPKVLEKEAEKKLPSDTEKEDRSPSAI FSADLGKTSWDLLYWRDI KKTGV 


1005 


Db 


960 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 h 1 II 1 1 1 1 1 II 1 1 M 1 1 1 1 1 
TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDI KKTGV 


1019 


Qy 


1006 


VFGASLFLLLSLTVFS I VSVTAYI ALALLS VTI SFRI YKGVI QAI QKSDEGHPFRAYLES 


1065 


Db 


1020 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 

VFGASLFLLLSLTVFS I VSVTAYI ALALLS VTI SFRI YKGVI QAI QKSDEGHPFRAYLES 


1079 


Qy 


1066 


EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1125 


Db 


1080 


MM MINI MINIM IIIMIIilll INN IIIMIIIIhll 1 Mill 1! Ill 

EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1139 


Qy 


1126 


LLILALI SLFS VPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 




Db 


1140 


IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIII 

LLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 




RESULT 
US-09-£ 


5 

193-348- 


-23 





; Sequence 23, Application US/09893348 
; Patent No. US20020072493A1 
; GENERAL INFORMATION: 

; APPLICANT: EI SENBACH- SCHWARTZ , Michal 
; APPLICANT: COHEN, I run R. 
; APPLICANT: BESERMAN, Pierre 
; APPLICANT: MOSONEGO, Alon 
; APPLICANT: MOALEM, Gila 

; TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM - SPECI FIC ANTIGENS AND 
THEIR USES 

; FILE REFERENCE: EI S- SCHWARTZ =2A 

CURRENT APPLICATION NUMBER: US/ 0 9/8 93 , 34 8 

CURRENT FILING DATE: 2001-06-28 

PRIOR APPLICATION NUMBER: US 09/314,161 
; PRIOR FILING DATE: 1999-05-19 
; PRIOR APPLICATION NUMBER: US 09/218,277 

PRIOR FILING DATE: 1998-12-22 

PRIOR APPLICATION NUMBER: PCT/US98/14715 



PRIOR FILING DATE: 1998-07-21 
; PRIOR APPLICATION NUMBER: IL 124500 
; PRIOR FILING DATE: 1998-05-19 
; NUMBER OF SEQ ID NOS : 29 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 23 

LENGTH: 1192 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-893-348-23 

Query Match 98.1%; Score 5810; DB 9; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 6.2e-284; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3; 



Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

.1 1 1 1 1 1 1 1 1 1 1 i I II 1 1 1 1 II ! II 1 1 1 1 II J II I M : 1 1 ! 1 1 , M 1 1 III 1 1 1 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 

MM I II MIMI MMM IMMMMMIMM MIMIMIIIM IMIIMM II 

Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

M 1 1 1 1 1 1 1 1 1 II ! 1 1 1 1 1 1 1 1 1 1 E ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 i 1 1 1 M 1 1 1 1 1 1 1 1 1 

Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 
Qy 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

II- h = MMMM IMIIIIMMII III Mill 

Db 181 SSGSVDETLFALPAASEPVI RSSAENMDLKEQPGNTI SAGQEDFPSVLLETAASLPSLSP 24 0 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

Db 241 LSAASFKEHEYLGNLSTVXjPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 3 00 

Qy 287 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 346 

MIIIIIIIIIMIIIIIIIMIIMIIIIIIIMMIMI 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 

Db 3 01 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 406 

M I II 1 1 !l 1 1 1 1 1 1 1 II h II I II 1 1 1 III 1 1 : 1 1 1 li 1 1 1 II 1 1 1 M 1 1 1 1 1 1 

Db 3 61 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 42 0 

Qy 4 07 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

1 1 II 1 1 M 1 1 MM III I. 1 1 1 Ml II III 1 1 . IMMIIMM MIIIMI 

Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 48 0 

Qy 467 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 

Ml MIMI 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 ! II M M II II 1 1 1 1 II II I II 

Db 481 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 54 0 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

M M 1 1 1 1 1 I II 1 1 1 1 1 1 1 1 Mill III I f 1 1 1 1 r 1 1 1 1 1 1 1 1 1 MIMI Mill II 

Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 60 0 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 


601 


EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 


660 


Qy 


647 


EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 

Mill 1 1 II 1 M II 1 III II MM M II Ml II II II ill MM 1 Ml II MM 

EAMS VSLKKVSG I KEE I KEPEN I NAALQETEAP Y I S I ACDL I KETKLSAE PAPDFSDYS E 


705 


Db 


661 


720 


Qy 


706 


MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 

1 1 1 II 1 1 1 1 1 i 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


765 


Db 


721 


MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 


780 


Qy 


766 


EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 


825 


Db 


781 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 

EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 


840 


Qy 


826 


SNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


885 


Db 


841 


1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II II 1 

SNDDLFISKEAQIRETETFSDSSPIEI I DEFPTLI SSKTDSFSKLAREYTDLEVSHKSE I 


900 


Qy 


886 


ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 


945 


Db 


901 


M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I M 1 1 1 | | | | | | | | | | | | | | 

ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 


959 


Qy 


946 


TQAEI ESI VKPKVLEKEAEKKLPSDTEKEDRSPSAI FSADLGKTSWDLLYWRDI KKTGV 


1005 


Db 


960 


Illllillllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 I || I I I I I I I h | 1 1 1 I I I I I I I | | M | | | | 
TQAEI ES I VKPKVLVKEAEKKLPSDTEKEDRSPSAI FSAELSKTSWDLLYWRDI KKTGV 


1019 


Qy 


1006 


VFGASLFLLLSLTVFSI VSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLES 


1065 


Db 


1020 


iM II II M II II M MM II lli M 1 M III II 1 IMMM II 1 M II II 1 II MM 

VFGASLFLLLSLTVFSI VSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLES 


1079 


Qy 


1066 


EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1125 


Db 


1080 


l I M l i: < I : 1 I M I 1 i 1 

EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1139 


Qy 


1126 


LL I LALI SLFS VP VI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 117 8 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1| 1 1| 1 1 11 1| 1 1 1 1 1 1 




Db 


1140 


LLI LALI SLFS VPVI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 




RESULT 
US-09-£ 


6 

J93-348- 


18 





; Sequence 18, Application US/09893348 
; Patent No. US20020072493A1 
; GENERAL INFORMATION: 

; APPLICANT: EI SENBACH- SCHWARTZ , Michal 
; APPLICANT: COHEN, I run R. 
; APPLICANT: BESERMAN, Pierre 
; APPLICANT: MOSONEGO, Alon 
; APPLICANT: MOALEM, Gila 

; TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EIS-SCHWARTZ=2A 

CURRENT APPLICATION NUMBER: US/ 09/8 93 , 34 8 

CURRENT FILING DATE: 2001-06-28 
; PRIOR APPLICATION NUMBER: US 09/314,161 

PRIOR FILING DATE: 1999-05-19 

PRIOR APPLICATION NUMBER: US 09/218,277 



PRIOR FILING DATE: 1998-12-22 

PRIOR APPLICATION NUMBER: PCT/US98 /14 715 
; PRIOR FILING DATE: 1998-07-21 

PRIOR APPLICATION NUMBER: IL 124500 
; PRIOR FILING DATE: 1998-05-19 
; NUMBER OF SEQ ID NOS : 2 9 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 18 

LENGTH: 1163 
TYPE : PRT 

ORGANISM: Rattus norvegicus 
US-09-893-348-18 

Query Match 72.5%; Score 4296.5; DB 9; Length 1163; 

Best Local Similarity 74.0%; Pred. No. 7.4e-208; 

Matches 885; Conservative 104; Mismatches 156; Indels 51; Gaps 19; 
Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Mhlll Mill 1 1 1 1 1 1 MINIM MINI 1 1 M I M h M II 1 1 1 1 II II 1 1 

Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 6 0 

Qy 5 9 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

MINIM II III MINI M MIMMMIMM i I II : I 

Db 61 PAAGLSAAAVP - -PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

Qy 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Ml MM I I : I I I I I U || M I Ml I 

Db 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Qy 179 RGSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSL 224 



Db 



167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 22 




Qy 



225 SPLSAASFKEHEYLGNLSTVljPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 284 

M M M MINN Mi- Ml I MINIM 

227 SPLSTVSFKEHGYLGNLSAVSSSEGTI EETLNEASKELPERATNPFVNRDLAEFSELEYS 2 8 6 



Db 



Qy 




Db 



Qy 




Db 



Qy 



404 SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIAT 463 

MIMM Mill : I II I hi MIIMI Ml I MINI I Mill 

3 97 SKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTTA 455 



Db 



Qy 




Db 



Qy 



524 VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 583 



Db 




Qy 


584 


PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 


642 


Db 


575 


MMIMMMMMMMMMM MMIMIMM Mill IMMM Mill 

PSFEEAEATPSPVLPDI VMEAPLNSLLPSAGAS VVQPS VS PLEAPPPVS YDS I KLEPENP 


634 


Qy 


643 


P P YEEAMS VSLKVSG I KEE I KEPENI NAALQETEAP Y I S I ACDLI KETKLSAE PAPDFSD 


702 


Db 


635 


II II II h h II 1 II II II h II h 1 II 1 II II 1 1 1 II 1 1 II 1 II 1 1 h II 1 h 

PPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYI SI ACDLI KETKXiSTEPSPDFSN 


694 


Qy 


703 


YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFE 

M M M h 1 h h 1 1 II 1 II h 1 1 1 II 1 1 II 1 II h 1 1 1 hi 1 1 h II 1 II 1 1 1 

YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSI PEVPQTQEEAVMLMKESLTEVS-E 


762 


Db 


695 


753 


Qy 


763 


SMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELST 


822 


Db 


754 


M M II 1 1 1 II 1 1 1 1 h : I MM - II M 1 1 II Mill M 

TVAQHK- EERLSAS PQELGKP YLES FQPNLHSTKDA - ASNDI PTLTKKEKI SLQMEEFNT 


811 


Qy 


823 


AVYSNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHK 


882 


Db 


812 


IMIIMI III MMMMII Mill MMIM MM II 

AI YSNDDLLSSKEDKI KESETFSDSS PI EI I DEFPTFVSAKDDS - PKLAKEYTDLEVSDK 


870 


Qy 


883 


SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 


942 


Db 


871 


1 M M II 1 1 1 1 1 1 1 II 1 | | | | | : : : MMIM M II : 1 M 1 1 
SEIANIQSGADSLPCLELPCDLSFKNI YPK- -DEVHVSDEFSENRSSVSKASISPSNVSA 


928 


Qy 


943 


LGHTQAEI ES I VKPKVLEKEAEKKLPSDTEKEDRSPSAI FSADLGKTSWDLLYWRDI KK 

1 1 1 : MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 h MM II 1 II 1 II 1 1 1 1 1 1 1 

L-EPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKK 


1002 


Db 


929 


987 


Qy 


1003 


TGVVFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAY 


1062 


Db 


988 


1 1 1 M II M 1 1 II II 1 1 1 1 M 1 II 1 1 II 1 1 II 1 1 1 1 M 1 1 1 1 II 1 1 1 1 M 1 II 1 1 1 1 1 II 

TGWFGASLFLLLSLTVFSI VSVTAY I ALALLSVTISFR I YKGVIQAIQKSDEGHPFRAY 


1047 


Qy 


1063 


LESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 

1 M 1 1 1 II 1 1 1 1 M 1 M 1 1 1 1 1 1 1 


1122 


Db 


1048 


LESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 


1107 


Qy 


1123 


GLTLLI LAL I SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

IIIMMMMMIMMIMM IMMMMMMIMMMIMMMMIh 




Db 


1108 


GLTLLI LAL I SLFS I P VI YERHQVQI DHYLGLANKS VKDAMAKI QAKI PGLKRKAD 1163 





RESULT 7 
US-09-789-386-6 

; Sequence 6, Application US/09789386 
; Patent No. US20020010324A1 
; GENERAL INFORMATION: 
; APPLICANT: MICHALOVICH, DAVID 
; APPLICANT: PRINJHA, RABINDER KUMAR 
; TITLE OF INVENTION: NOVEL COMPOUNDS 
FILE REFERENCE: GP-30165-C1 

CURRENT APPLICATION NUMBER : US/09/78 9 , 3 8 6 
; CURRENT FILING DATE: 2001-02-21 

PRIOR APPLICATION NUMBER: U.K. 9916898.1 

PRIOR FILING DATE: 1999-07-19 

PRIOR APPLICATION NUMBER: U.K. 9816024.5 

PRIOR FILING DATE: 1998-07-22 
; PRIOR APPLICATION NUMBER: US 09/359,208 



PRIOR FILING DATE: 1999-07-22 
; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 6 

LENGTH: 373 
TYPE : PRT 

ORGANISM: HOMO SAPIENS 
US-09-789-386-6 

Query Match 25.2%; Score 14 95.5; DB 9; Length 373; 

Best Local Similarity 31.6%; Pred. No. 1.2e-67; 

Matches 372; Conservative 1; Mismatches 0; Indels 805; Gaps 1 
Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 6 0 

l!lhll MIIIMIM IIIIIIIMI llllllllll M'MIIIIII.IIII 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 6 0 

Qy 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

U I I I I I I I I I U i,l M L! I I I I I I I I 

Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

illMMIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIM 

Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGN 240 



Db 181 SSGSV 185 

Qy 241 LSTVTjPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAV 300 

Db 186 185 

Qy 3 01 IVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTJCLVKEDEWSSEKAKDSFNEKRV 360 

Db 186 185 

Qy 3 61 AVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNH 420 

Db 186 185 

Qy 421 EKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDE 480 

Db 186 185 

Qy 4 81 KKI EEKKAQI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEVVANMPEGLTPDL 54 0 

Db 186 185 

Qy 541 VQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDI 600 

Db 186 185 

Qy 601 VMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKVSGIKE 660 



Qy 661 EIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELV 720 
Db 186 185 



Qy 


721 


EDSSPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFESMI EYENKEKLSALPPEG 


780 


Db 


186 










185 


Qy 


781 


GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 84 0 


Db 

Qy 


186 
841 






TETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 


185 
900 


Db 


186 








185 


Qy 


901 


PHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLE 


960 


Db 


'186 










185 


Qy 


961 


KEAEKKLPSDTEKEDRSPSAI F S AD LGKTS WDLLYWRDI KKTGWFGASLFLLLSLTVF 


1020 


Db 


186 


1 lllllillM.llilllilil III 

WDLLYWRDI KKTGWFGASLFLLLSLTVF 


215 


Qy 


1021 


SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 


1080 


Db 


216 


IIIIIMMIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIMIIIIIIIIIMIII 

SIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 


275 


Qy 


1081 


ALGHVNCTI KELRRLFLVDDLVDSLKFAVLMW VFT Y VGALFNGLTLL I LAL I SLFSVPVI 


1140 


Db 


276 


M 1 1 1 1 I 1 1 1 1 M , 1 U 1 1 1 1 M 1 M'l H 

ALGHVNCTI KELRRLFLVDDLVDS LKFAVLMW VFT Y VGALFNGLTLL I LAL I SLFSVPVI 


335 


Qy 


1141 


YERHQAQ I DH YLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 




Db 


336 


llllllllllllllllllllllllllllllllllllll 

YERHQAQ I DH YLGLANKNVKDAMAKI QAKI PGLKRKAE 373 





RESULT 8 
US-09-765-205-6 

; Sequence 6, Application US/09765205 
; Patent No. US200200348 00A1 
; GENERAL INFORMATION: 
; APPLICANT: Cao, Li 

; TITLE OF INVENTION: BONE MARROW SECRETED PROTEINS AND POLYNUCLEOTIDES 

FILE REFERENCE: 1458.004/200130.44 9 
; CURRENT APPLICATION NUMBER: US/ 09/765 , 2 05 
CURRENT FILING DATE: 2001-01-17 
PRIOR APPLICATION NUMBER: US/09/2 12 , 44 0 
PRIOR FILING DATE: 1998-12-16 
NUMBER OF SEQ ID NOS : 46 
; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 6 

LENGTH: 373 
TYPE : PRT 
ORGANISM: human 
US-09-765-205-6 



Query Match 25.2%; Score 14 95.5; DB 9; Length 373; 

Best Local Similarity 31.6%; Pred. No. 1.2e-67; 

Matches 372; Conservative 1; Mismatches 0; Indels 805; Gaps 1; 
Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

IMI Mil! I lllllll IIIMM llllll Ihll! Illlll II Mill III llllllll 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 

Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

IIIMIMIMIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIMIIIIIMIIII 

Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGN 24 0 

MM 

Db 181 SSGSV _ 185 

QY 241 LSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAV 300 
Db 186 185 

Qy 301 IVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRV 360 
Db 186 185 

Qy 361 AVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNH 420 
Db 186 185 

Qy 421 EKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDE 480 
Db 186 185 

Qy 481 KKIEEKKAQIWEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDL 540 
Db 186 _ 185 

Qy 541 VQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDI 6 00 
Db 186 185 

Qy 601 VMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKVSGIKE 660 
Db 186 185 

Qy 661 EIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELV 720 
Db 186 185 

Qy 721 EDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEG 78 0 
Db 186 185 

Qy 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 84 0 



Db 



186 



185 



Qy 


841 


TETFSDSSPIEI IDEFPTLI SSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 


900 


Db 


186 




185 




Qy 


901 


PHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESI VKPKVLE 


960 


Db 


186 




185 




Qy 


961 


KEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVF 


1020 


Db 


186 


Mi Mill IIIIMII III IIMIIMII 

WDLLYWRDI KKTGWFGAS LFLLLSLTVF 


215 


Qy 


1 1 


SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVEE^SNS 
i i i i i i i i t i i i i i i t i i i i i i t i i i i i i i i i i i • t i i i i i i i i i i i i i i i i i i i i i i i i 


1080 


Db 


216 


1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 

SI VSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 


275 


Qy 


1081 


ALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFSVPVI 


1140 


Db 


276 


II M 1 1 II II 1 1 II M II 1 M II II 1 ! 1 1 M 1 1 1 II II 1 1 II II 1 1 1 1 1 1 1 1 M 1 II M 1 

ALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFSVPVI 


335 


Qy 


1141 


YERHQAQIDHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1178 




Db 


336 


IIIIIIMIIIIIIIIIIMIIMIIIIIIIIIIIIII 

YERHQAQ I DH YLGLANKNVKDAMAKI QAKI PGLKRKAE 373 




RESULT 9 
US-09-893 


-348- 


-24 





; Sequence 24, Application US/09893348 
; Patent No. US20020072493A1 
; GENERAL INFORMATION: 

; APPLICANT: E I SENBACH- SCHWARTZ , Michal 
; APPLICANT: COHEN, I run R. 
; APPLICANT: BESERMAN, Pierre 
; APPLICANT: MOSONEGO, Alon 
; APPLICANT: MOALEM, G 

; TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EIS-SCHWARTZ=2A 
; CURRENT APPLICATION NUMBER: US/09/8 93 , 348 

CURRENT FILING DATE: 2001-06-28 

PRIOR APPLICATION NUMBER: US 09/314,161 
; PRIOR FILING DATE: 1999-05-19 

PRIOR APPLICATION NUMBER: US 09/218,277 
; PRIOR FILING DATE: 1998-12-22 50XL 

PRIOR APPLICATION NUMBER: PCT/US / 98/14 715 

PRIOR FILING DATE: 1998-07-21 

PRIOR APPLICATION NUMBER: IL 124500 

PRIOR FILING DATE: 1998-05-19 
; NUMBER OF SEQ ID NOS : 2 9 

SOFTWARE: Patent In version 3.1 
; SEQ ID NO 24 
LENGTH: 373 
TYPE : PRT 

ORGANISM: Homo sapiens 



US-09-893-348-24 

Query Match 25.2%; Score 1495.5; DB 9; Length 373; 

Best Local Similarity 31.6%; Pred. No. 1.2e-67; 

Matches 372; Conservative 1; Mismatches 0; Indels 8 05; Gaps 1 
Qy ' 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

IIIIIIMIIMMMIIMIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIM 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFWEPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 

1 . 1 1 1 II MMII M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 Ml 1 1 1 IIMI I II M I M 1 1 1 1 1 1 1 1 i , 

Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

: M I I I I I I I ;:l ! I I I I i | | | |i| | |,| M i 

Db 121 SPLSAAA¥0B£K5Xe EPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

Qy 181 SSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGN 240 

Ilhl 

Db 181 SSGSV . 

Qy 241 LSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAV 300 
Db 186 185 

Qy 301 IVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRV 360 
Db 186 135 

Qy 361 AVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNH 420 
Db 186 185 

Qy 421 EKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDE 480 
Db 186 185 

Qy 481 KKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDL 54 0 
Db 186 185 

Qy 541 VQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDI 60 0 
Db 186 185 

Qy 601 VMEAPLNSAVPSAGASVI QPSSS PLEASS VNYES I KHEPENPPPYEEAMS VSLKVSGI KE 660 
Db 186 185 

Qy 661 EIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELV 720 

Db 186 185 

Qy 721 EDSSPDSEPVDLFSDDS I PDVPQKQDETVMLVKESLTETSFESMI EYENKEKLSALPPEG 780 



Qy 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVYSNDDLFI SKEAQIRE 84 0 

Db 186 185 

Qy 841 TETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 900 

Db 186 185 

Qy 901 PHDLSLKNI QPKVEEKI SFSDDFSKNGSATSKVLLLPPDVSALGHTQAEI ES I VKPKVLE 960 

Db 186 185 

Qy 961 KEAEKKLPSDTEKEDRSPSAI FSADLGKTSWDLLYWRD I KKTGWFGASLFLLLSLTVF 1020 

I! I I I I I I I I I I I II I I I il Ml II 

Db 186 WDLLYWRDI KKTGWFGASLFLLLSLTVF 215 

Qy 1021 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 1080 

III MINIM I MINIM Ml Illllllillllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 216 SIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 275 

Qy 1081 ALGHWCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFSVPVI 114 0 

I I I N N I I I I I I I I II II I M I II II II I II I I II I II I I I I I I II I II II I II II I I I 
Db 276 ALGHWCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALI SLFSVPVI 335 

Qy 1141 YERHQAQ I DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1178 

1 1 1! 1 1 1 1 1 : ii Ih I h 1 1 :i 1 1 1 1 1 1 1 imm 

Db 336 YERHQAQ I DH YLGLANKNVKDAMAKI QAKI PGLKRKAE 373 

RESULT 10 
US-10-060-036-72 

Sequence 72, Application US/10060036 
Publication No. US20030073144A1 
GENERAL INFORMATION: 
APPLICANT : Benson, Darin R. 
APPLICANT: Kalos, Michael D. 

AP : Lodes, Michael J. LERKP4 5 

APPLICANT: Persing, David H. 
APPLICANT: Hepler, William T. 
APPLICANT: Jiang, Yuqiu 

TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY 
TITLE OF INVENTION: AND DIAGNOSIS OF PANCREATIC CANCER 
FILE REFERENCE: 210121.566 

CURRENT APPLICATION NUMBER: US/ 10/060 , 036 
CURRENT FILING DATE: 2002-01-30 
NUMBER OF SEQ ID NOS : 4560 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 72 
LENGTH: 373 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-060-036-72 



Query Match 25 . 2%; 

Best Local Similarity 31.6%; 
Matches 372; Conservative 



Score 14 95.5; DB 15; Length 373; 
Pred. No. 1.2e-67; 
1; Mismatches 0; Indels 805; Gaps 1; 



Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIMMIIIIMIIIIMIIIIII 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFWEPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 

MINIMI III! II I IMIIIIIL II MINI Ml II MINIM MINIMUM 

Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 

Qy 121 SPLSAAAVSPSKXjPEDDEPPARPPPPPPASVSPQAEPWTPPAPAPAAPPSTPAAPKRRG 180 

lllllll II I Mill MINIMMMIMIMM III Mill NNIIIIIMIMN 

Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGN 24 0 



Db 181 SSGSV--- 18950XA 

Qy 241 LSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAV 300 

Db 186 185 

Qy 301 IVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRV 360 

Db 186 185 

Qy 361 AVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNH 420 

Db 186 ■ ; 185 

Qy 421 EKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDE 480 

I 

Db 186 185 

Qy 481 KKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDL 540 

Db 186 185 

Qy 541 VQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDI 600 

Db 186 185 

Qy 601 VMEAPLNSAVP SAGAS VI QPSSS PLEASSVNYES I KHEPENPPPYEEAMS VSLKVSGI KE 660 

Db 186 185 

Qy 661 EI KEPENINAALQETEAPYI S IACDLI KETKLSAEPAPDFSDYSEMAKVEQPVPDHSELV 72 0 

Db 186 185 

Qy 721 EDSSPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFESMI EYENKEKLSALPPEG 78 0 

Db 186 185 

Qy 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVYSNDDLFI SKEAQI RE 84 0 

Db 186 185 



Qy 841 TETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 900 

Db 186 185 

Qy 901 PHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESI VKPKVLE 960 

Db 186 185 

Qy 961 KEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVF 1020 

IMIIIIIIIIIIIIIIIIIIIIMMMI 

Db 186 WDLLYWRDI KKTGVVFGASLFLLLSLTVF 215 

Qy 1021 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 1080 

MIMMII MIMMIM IMMMIM: MMMMMMMIMM II MM 

Db 216 S I VS VTAYI ALALLSVTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNS 275 

Qy 1081 ALGHVNCTI KELRRLFLVDDLVT)SLKFAVLMWVFTYVGALFNGLTLLILALI SLFSVPVI 1140 

11 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 i 1 1 1 1 i 1 1 1 1 

Db 276 ALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFSVPVI 33 5 

Qy 1141 YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 336 YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 373 



RESULT 11 
US-09-789-386-4 

Sequence 4, Application US/09789386 
Patent No. US20020010324A1 
GENERAL INFORMATION: 
APPLICANT: MICHALOVICH, DAVID 
APPLICANT: PRINJHA, RABINDER KUMAR 
TITLE OF INVENTION: NOVEL COMPOUNDS 
FILE REFERENCE: GP-30165-C1 

CURRENT APPLICATION NUMBER: US/09/789,386 
CURRENT FILING DATE: 2001-02-21 
PRIOR APPLICATION NUMBER: U.K. 9916898.1 
PRIOR FILING DATE: 1999-07-19 
PRIOR APPLICATION NUMBER: U.K. 9816024.5 
PRIOR FILING DATE: 1998-07-22 
PRIOR APPLICATION NUMBER: US 09/359,208 
PRIOR FILING DATE: 1999-07-22 
NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 4 
LENGTH : 2 8 9 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
FEATURE : 

NAME/KEY: UNSURE 
LOCATION: (31) (138) 
US-09-789-386-4 



Query Match 23.9%; Score 1417; DB 9; Length 28 9; 

Best Local Similarity 99.3%; Pred. No. 7.6e-64; 

Matches 285; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 



Qy 


193 


Db 


3 


Qy 


253 


Db 


63 


Qy 


313 


Db 


123 


Qy 


373 


Db 


183 


Qy 


433 


Db 


243 



I I I I U 'IU I U I I I I ! I I'M II I I 

MDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQ 
ENVS EASKE VSEKAKTLLI DRDLTEFSELE YS EMGSS FSVS PKAESAVI VANPREE 1 1 VK 

M; 1 1 1 II I II I Mi 1 1 I il 1 1 1 1 1, 1 1 1 li I! I II !■ II I II M I I I I I 

ENVS EAS KE VSEKAKTLLI DRDLTEFSELE YS EMGSS FSVS PKAESAVI VANPREE 1 1 VK 
NKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYAD 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

NKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYAD 
FKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTS 

I MIMMMMM MMMMIIMM MMIIIIIMIM MIMMIMM 

FKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTS 
FPSTPEGI KDRSGAYI TCAPFNPAATES I ATNI FPLLEDPTSENXTD 4 7 9 

M iMMMIMM MIMMMMM IM MINI II 



RESULT 12 
US-10-205-194-164 

Sequence 164, Application US/10205194 
Publication No. US200301343 01A1 
GENERAL INFORMATION: 
APPLICANT: Warner-Lambert Company 
APPLICANT: Lee, Kevin 
APPLICANT: Dixon, Alistair 
APPLI CANT : Brooksbank , Robert 
APPLICANT: Pinnock, Robert 

TITLE OF INVENTION: Identification and Use of Molecules Implicated in Pain 
FILE REFERENCE: WL-A-018201 

CURRENT APPLICATION NUMBER: US/10/205,194 
CURRENT FILING DATE: 5200-07-24 
PRIOR APPLICATION NUMBER: GB 0118354.0 
PRIOR FILING DATE: 2001-07-27 
NUMBER OF SEQ ID NOS : 177 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 164 
LENGTH: 379 
TYPE: PRT 

ORGANISM: Rattus norvegicus 
FEATURE : 

OTHER INFORMATION: Foocen-m2 reticulon 
US-10-205-194-164 

Query Match , 20.7%; Score 1225.5; DB 12; Length 379; 

Best Local Similarity 28.6%; Pred. No. 4.6e-54; 

Matches 338; Conservative 17; Mismatches 22; Indels 803; Gaps 10; 

Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Ilhlll Mill llllll llllllll llllll IIIIM|::|| llllllll III 

Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 5 9 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

llllllll II III IN :| I I llllllllllll llllllll: || : I 



Db 61 PAAGLSAAAVP - - PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

Qy 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

III 1 1 1 1 IIIIIIIIIIIIMIIIIII II II lllllllllllll 

Db 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Qy 17 9 RGSSGAWXXXXKI MDLKEQPGNTI SAGQEDFPS VLLETAASXPSLS PLSAASFKEHEYL 238 
III 

Db 167 RGS 169 

Qy 23 9 GNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAES 2 98 

Db 170 169 

Qy 299 AVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEK 358 

Db 170 169 

Qy 359 RVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQT 418 

Db 170 169 

Qy 419 NHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXT 478 

I hi I 

Db 17 0 GSVDETLF 177 

Qy 47 9 DEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTP 538 

Db 178 177 

Qy 539 DLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLP 5 98 

Db 178 177 

Qy 599 DIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKVSGI 658 

Db 178 ALPAA 182 

Qy 659 KEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSE 718 

Db 183 182 

Qy 719 LVEDSSPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPP 778 

I I I I 

Db 183 SEPV 1 186 

Qy 779 EGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVYSNDDLFI SKEAQI 838 

Db 187 186 

Qy 839 RETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCT 898 

Db 187 186 

Qy 899 ELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESI VKPKV 958 

Db 187 1 86 



Qy 


959 


Db 


187 


Qy 


1019 


Db 


220 


Qy 


1079 


Db 


280 


Qy 


1139 


Db 


340 



-IPS SAWDLLiYWRD I KKTGWFGASLFLLLSLT 219 



IMIIMIIIIMIIIIIMIIIMIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIII 

TS I VSVTAYI ALALLSVTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYS 
fSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFSVP 

MUM M I M M M M MM I M M M MM M M M M MMM M MMM M 

[SALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I P 
r I YERHQAQ I DHYLGLANKN VKDAMAKI QAKI PGLKRKAE 1178 

llllll M M M M MMM MM M MM M M M: 



RESULT 13 
US-09-893-348-20 

Sequence 20, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 
APPLICANT: E I SENBACH- SCHWARTZ , Michal 
APPLICANT: COHEN, I run R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM , Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EIS -SCHWARTZ=2A 
CURRENT APPLICATION NUMBER: US/ 09/8 93 , 34 8 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 29 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 20 
LENGTH: 360 
TYPE : PRT 

ORGANISM: Rattus norvegicus 
US-09-893-348-20 

Query Match 20.0%; Score 1187; DB 9; Length 360; 

Best Local Similarity 27.7%; Pred. No. 3.7e-52; 

Matches 327; Conservative 12; Mismatches 19; Indels 822; Gaps 7; 
Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

MMM Hill llllll llllllll llllll 1 1 i 1 1 = 1 1 ^ 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 



Qy 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

MMIMI II III Mhll H IMIIIIIIIIIII MINIM: || : | 
Db 61 PAAGLSAAAVP- -PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

Qy 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

III 1 1 1 1 (MINIM MINIUM I II II lllllllllllll 

Db 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Qy 179 RGSSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYL 238 
III 

Db 167 RGS 169 

Qy 23 9 GNLSTVLPTEGTLQENVS EAS KEVS EKAKTLLI DRDLTEFS ELE YS EMGSSFS VS PKAES 2 98 

Db 1700 169 

Qy 299 AVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEK 358 

Db 170 169 

Qy 359 RVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQT 418 

Db 170 169 

Qy 419 NHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXT 478 

Db 170 169 

Qy 479 DEKKIEEKKAQI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEVVANMPEGLTP 538 

Db 170 169 

Qy 53 9 DLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLP 598 

Db 170 169 

Qy 599 DIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKVSGI 658 

Db 170 169 

Qy 659 KEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSE 718 

Db 170 169 

Qy 719 LVEDSS PDSEPVDLFSDDS I PDVPQKQDETVMLVKESLTETSFESMI EYENKEKLSALPP 778 

SAELSKTL 

Db 170 169 

Qy 779 EGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVYSNDDLFI SKEAQI 838 

Db 170 169 

Qy 839 RETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCT 898 

lh 

Db 170 GSV 172 

Qy 8 99 ELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKV 958 



Db 



173 



172 



Qy 


959 


LEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRD I KKTGWFGASLFLLLSLT 


1018 


Db 


173 


II 1 II ! 1 1 1 1 ! II 1 1 1 III II 1 1 1 
WDLLYWRDI KKTGWFGASLFLLLSLT 


200 


Qy 


1019 


VFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYS 


1078 


Db 


201 


IIIIIIIIIIMIIIIIIIIIIMIIIIIMIIIIIIIMMIIIIIIIMIIIIIIIII 

VFSI VSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYS 


260 


Qy 


1079 


NSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMW VFT YVGALFNGLTLL I LAL I SLFSVP 

Illlllll MINN IMIIIIIIIIIII MINIM IIIMMIIII IIMIIII M 

NSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMW VFTYVGALFNGLTLL I LAL I SLFSI P 


1138 


Db 


261 


320 


Qy WO2 0 


ID1393Y 1 . 5X12DHXLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 
i 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 . II 1 1 1 1 i 1 1 1 1 1 1 1 I E I I I . 




Db 


321 


1 l 1 1 - 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 • 

VI YERHQVQ I DHYLGLANKSVKDAMAKI QAKI PGLKRKAD 360 




RESULT 14 
US-09-893 


-348- 


-25 





Sequence 25, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 
APPLICANT: EI SENBACH- SCHWARTZ , Michal 
APPLICANT: COHEN, I run R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM , Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EIS-SCHWARTZ-2A 
CURRENT APPLICATION NUMBER: US/09/893 , 348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14715 
PRIOR FILING DATE: 1998-07-21 

Pn LICATION NUMBER: IL 124500 otide30 d 

PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 2 9 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 25 
LENGTH: 199 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-893-348-25 

Query Match 15.7%; Score 931; DB 9; Length 199; 

Best Local Similarity 99.0%; Pred. No. 1.3e-39; 

Matches 189; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 
Qy 988 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1047 

I h i 1 , M ! l 1 1 1 1 1 1 1 1 Ml III II 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 III II 1 1 1 



Db 



9 KDKWDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLS VTI SFRI YKGVI 68 



Qy 1048 QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNCT I KELRRLFLVDDLVDSLKF 1107 

I M 1 1 1 II I M I M I M 1 1 1 II 1 1 II II I M 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 

Db 69 QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKF 128 

Qy 1108 AVLMWVFTYVGALFNGLTLLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI Q 1167 

I I I I I I I I ! I I I I I ! I I I I I I I I 1 1 I I I 1 1 I I I I I I I I I I I I I I I I I : 1 1 I I I 

Db 12 9 AVLMWVFTYVGALFNGLTLLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI Q 188 

Qy 1168 AKI PGLKRKAE 1178 

lllllllllll 
Db 18 9 AKI PGLKRKAE 199 



RESULT 15 
US-09-893-348-21 

Sequence 21, Application US/09893348 
Patent No. US200200724 93A1 
GENERAL INFORMATION: 
APPLICANT: EI SENBACH- SCHWARTZ , Michal 
APPLICANT: COHEN , I run R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T- CELLS , NERVOUS SYSTEM- SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EIS-SCHWARTZ=2A 
CURRENT APPLICATION NUMBER: US/09/8 93 , 348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98 / 14 715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 2 9 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 21 
LENGTH: 19 9 
TYPE : PRT 

ORGANISM: Rattus norvegicus 
US-09-893-348-21 

Query Match 15.3%; Score 908; DB 9; Length 199; 

Best Local Similarity 96.3%; Pred. No. 1.9e-38; 

Matches 184; Cons££ya6ive 3; Mismat 4; Indels 0; Gaps 0; 

Qy 988 KTS WDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLS VTI SFRI YKGVI 1047 

I IIIIMIIIIIIMMIIIMIIMIIIIIIMIMIIMIIMIIMIIIIIIIII 

Db 9 KDKWDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLSVTI SFRI YKGVI 68 

Qy 1048 QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKF 1107 

I I I M I I I I I I I I I I I I I ! I I I I I II M I I I I I ' I Ml M I II I II I I I i, :, I I 

Db 69 QAI QKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKF 128 



Qy 1108 AVLMWVFTYVGALFNGLTLLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI Q 1167 

M 1 1 1 II 1 1 1 1 1 1 1 II 1 1 M' 1 1 1 IN MM ,1 1 1 1 1 1 1 II 1 1 1 h I II 1 1 Ml 

Db 12 9 AVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQ I DHYLGLANKS VKDAMAKI Q 18 8 



Qy 1168 AKI PGLKRKAE 1178 

lilllllllh 
Db 18 9 AKI PGLKRKAD 199 



Search completed: January 22, 2004, 16:44:57 
Job time : 63.0516 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



January 22, 2004, 16:31:15 ; Search time 19.6482 Seconds 

(without alignments) 
2819.465 Million cell updates/sec 



Title: 

Perfect score : 
Sequence : 



US-09-830-972-29 
5923 

1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKI QAKI PGLKRKAE 1178 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 127863 seqs, 47026705 residues 

Total number of hits satisfying chosen parameters: 127863 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : SwissProt_41 : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

o 

Result Query 

No. Score Match Length DB ID Description 



1 


5815 


98 


.2 


1192 


1 


RTN4_HUMAN 


Q9nqc3 


homo sapien 


2 


4296.5 


72 


.5 


1163 


1 


RTN4 RAT 


Q9jkll 


rattus norv 


3 


915 


15 


.4 


199 


1 


RTN4_MOUSE 


Q99p72 


mus musculu 


4 


791 


13 


.4 


777 


1 


RTN1_RAT 


Q64548 


rattus norv 


5 


787.5 


13 


.3 


776 


1 


RTN1_HUMAN 


Q16799 


homo sapien 


6 


626 


10 


.6 


237 


1 


RTNplHOUSffi52 


Q9es97 


mus musculu 


7 


625.5 


10 


. 6 


236 


1 


RTN3_HUMAN 


095197 


homo sapien 


8 


514 


8 


.7 


545 


1 


RTN2_HUMAN 


075298 


homo sapien 


9 


500 


8, 


.4 


471 


1 


RTN2JVIOUSE 


070622 


mus musculu 


10 


324 . 5 


5, 


.5 


865 


1 


CPN_DROME 


Q02910 


drosophila 


11 


320 


5. 


. 4 


5147 


1 


PCLO__HUMAN 


Q9y6v0 


homo sapien 


12 


315 


5 . 


.3 


5038 


1 


PCL0_M0USE 


Q9qyx7 


mus musculu 


13 


299 


5. 


, 0 


3924 


1 


AN K2 _HUMAN 


Q01484 


homo sapien 


14 


296. 5 


5. 


, 0 


2468 


1 


MAPB_HUMAN 


P46821 


homo sapien 


15 


289 


4 . 


. 9 


2459 


1 


MAPB_RAT 


P15205 


rattus norv 


16 


289 


4 . 


9 


2464 


1 


MAPB_MOUSE 


P14873 


mus musculu 


17 


287 


4 . 


8 


5120 


1 


PCLO_CHICK 


Q9pu36 


gallus gall 



1 R 

-L O 


9 7£ 
z / o 


A 
4 


. 1 


3 U O 3 


X 


PLXjU KA1 


Q9jks6 


rattus norv 




9 64 ^ 

Z O *± . 3 


A 
*± 


. 3 


034 3 


X 




Q9n4m4 


caenorhabdi 


2 0 




A 
*± 


A 
. rt 




X 


or 1 OT, 7 TJT TTV/T A M 

rbLV HUMAN 


P13611 


homo sap i en 


2 1 


969 ^ 


A 
1 


A 


OOJZ 


X 


T TNT O Q OA TT'TTT 

UNyy LAhhL 


001761 


caenorhabdi 


z z 


9 £ n r 

Z O U . 3 


A 

4 


- 4 


O O O T 

33 81 


1 


O /""I /"IT T T— 1 ^"\T TT TkT 

PGCV BOVIN 


P81282 


bos taurus 


9 1 
z J 


zoy . o 


4 


. 4 


1828 


1 


MAP2 MOUSE 


P20357 


mus musculu 


94 
z *± 


91:0 cr 

Z 3 0 . 3 


A 

4 


A 

. 4 


0 q n c 
zo Ud 


X 


Jiff 7\ t-\ t\ ttt TH/lfy. >vT 

MA PA HUMAN 


P78559 


homo sapien 




9 S R 
z 3 0 




. 4 


lolb 


X 


PzUU MYLtjhi 


Q49429 


mycoplasma 


9 £ 
Z D 


9 ^"7 c: 
ZD / . j 


A 

4 . 


. J 


1 / O 1 


1 


AK12 HUMAN 


Q02952 


homo sapien 


27 


9 R 

3 / . 3 


A 
*± . 


0 


Z / 0 O 


X 


np /"it t DAT 

FLjLV RA1 


Q9erb4 


rattus norv 


2 8 


Z JJ 


A 

4 , 




1 1 on 

x x 0 y 


"1 

X 


idHb YEAbl 


P47035 


sac char omyc 


9 9 

Zj J 


9^9 
Z 3 Z 


A 

4 , 


. J 


/I *3 "7*7 


X 


A "KTT/"^ TTT TH/T7\ JiT 

ANK3 HUMAN 


Q12955 


homo sapien 


■J u 


9 r n r 

ZDU . D 


4 . 


. Z 


i q n 
lo bl 


1 


MAP2 RAT 


P15146 


rattus norv 


J -L 


O/ Q C 
J . D 


A 

4 . 


. z 


Xoz / 


X 


MAPz HUMAN 


P11137 


homo sapien 


7 9 
-j Z 


9 A Q 
Z4 C 


4 . 


, z 


34z 1 


1 


TEGU HSVEB 


P28955 


equine herp 


33 


944 

Z rt *± 


ft . 


1 

, X 


1 ico 


X 


nnr\/i ttt TK/r a m 

oKD4 HUMAN 


060885 


homo sapien 


^ 4 

3 ft 


94 9 
Z^t Z 


4 . 


1 


1 "7 O A 


1 


USOl YE1AST 


P25386 


saccharomyc 


35 


9^9 
i J J 


4 . 


0 


1 "744 
X 1 44 


1 
X 


1AJMA AhNijA 


Q01550 


xenopus lae 




9 ^R R 

JO . J 


4 . 


0 


14 11 
±4 X X 


X 


ILUr HUMAN 


Q13428 


homo sapien 


3 7 


9 7 R 

Z. 3 O . 3 


4 . 


0 


CO 
JJDO 


X 


PCjCV muuse 


Q62059 


mus musculu 


38 


235.5 


4 . 


0 


X / Z 0 


"1 

X 


A TIVII T_TT TIM A T\l 

A1MX HUMAN 


Q9y4kl 


homo sapien 


39 


233.5 


3 . 


9 


1087 


1 


NFH_MOUSE 


P19246 


mus musculu 


40 


232 


3 . 


9 


1140 


1 


YM96 YEAST 


Q04893 


saccharomyc 


41 


131 


3I9N: 


:43QS6 


1 


KI67_HUMAN 


P46013 


homo sapien 


42 


229.5 


3 . 


9 


1367 


1 


AM YH_ YEAST 


P08640 


saccharomyc 


43 


228 .5 


3 . 


9 


1306 


1 


MSB2_YEAST 


P32334 


saccharomyc 


44 


226.5 


3. 


8 


1157 


1 


BBC1_YEAST 


P47068 


saccharomyc 


45 


226 


3 . 


8 


1379 


1 


YFF9_SCHP0 


014066 


schizosacch 



ALIGNMENTS 



RESULT 1 
RTN4_HUMAN 

ID RTN4_HUMAN STANDARD; PRT; 1192 AA . 

AC Q9NQC3; 094962; Q9BXG5; Q9H212; Q9H3I3; Q9UQ42; Q9Y293; Q9Y2Y7; 

AC Q9Y5U6 ; 

DT 28-FEB-2003 (Rel . 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) (Foocen) 

DE (Neurogndo ¥r i5g 5S g f g^ c M otein > (QgP^ } (^ugo§iid ps ^n fi) specif ic 

DE protein C homolog) (RTN- * 

GN RTN 4 OR NOGO OR ASY OR KIAA0886 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI JTaxID=9606 ; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3) . 

RX MEDLINE=20129242; PubMed=10667780 ; 

RA Prinjha R., Moore S.E., Vinson M., Blake S., Morrow R., Christie G 

RA Michalovich D. , Simmons D.L., Walsh F.S.; 

RT "Inhibitor of neurite outgrowth in humans."; 

RL Nature 403:383-384(2000). 

RN [2] 



RP SEQUENCE FROM N.A. (ISOFORMS 1 AND 2) . 

RC TISSUE=Brain; 

RX MEDLINE=2 10106 96; PubMed=111263 6 0 ; 

RA Tagami S., Eguchi Y., Kinoshita M . , Takeda M . , Tsuj imoto Y. ; 

RT "A novel protein, RTN-XS, interacts with both Bel -XL and Bel -2 on 

RT endoplasmic reticulum and reduces their anti-apoptotic activity."; 

RL Oncogene 19:5736-5746(2000). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3) . 

RX MEDLINE=20237542; PubMed=10773 68 0 ; 

RA Yang J., Yu L., Bi A.D., Zhao S.-Y.; 

RT "Assignment of the human reticulon 4 gene (RTN4) to chromosome 

RT 2pl4-->2pl3 by radiation hybrid mapping."; 

RL Cytogenet. Cell Genet. 88:101-102(2000). 
RN [4] 

RP SEQUENCE FROM N.A. (ISOFORM 4) . 

RA Jin W. -L. , Ju G. ; 

RT "Developmentally-regulated alternative splicing in a novel Nogo-A."; 

RL Submitted (NOV-2 000) to the EMBL/ GenBank/DDB J databases. 
RN [5] 

RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 3) . 

RC TISSUE-Placenta, and Skeletal muscle; 

RA Ito T., Schwartz S.M.; 

RT "Cloning of a member of the reticulon gene family in human."; 

RL Submitted (FEB-1999) to the EMBL / GenBank / DDB J databases: 
RN [6] 

RP SEQUENCE FROM N.A. (ISOFORM 2) . 

RC TISSUE-Fibroblast; 

RA Yutsudo M. ; 

RT "Isolation of a cell death- inducing gene."; 

RL Submitted (JUN-1998) to the EMBL / GenBank / DDB J databases. 

RN [7] 

RP SEQUENCE FROM N.A. (ISOFORM 3) . 

RC TISSUE=Pituitary; 

RA Song H. , Peng Y. , Zhou J., Huang Q., Dai M., Mao Y.M. , Yu Y., Xu X., 

RA Luo B . , Hu R . , Chen J . ; 

RT "Human neuroendocrine-specif ic protein C (NSP) homolog gene."; 

RL Submitted (JUL- 1998) to the EMBL/ GenBank/DDB J databases. 

RN [8] 

RP SEQUENCE FROM N.A. (ISOFORM 3) . 

RA Gu J.R., Wan D.F., Zhao X.T., Zhou X.M., Jiang H.Q., Zhang P.P., 

RA Qin W.X., Huang Y. , Qiu X.K., Qian L.F., He L.P., Li H.N., Yu Y. , 

RA Yu J., Han L . H . ; 

RT "Novel human cDNA clone with function of inhibiting cancer cell 

RT growth . " ; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [9] 

RP SEQUENCE FROM N.A. (ISOFORM 1) . 

RC TISSUE-Brain; 

RX MEDLINE=99156230; PubMed=10 04 8485 ; 

RA Nagase T. , Ishikawa K.-I., Suyama M., Kikuno R . , Hirosawa M. , 

RA Miyajima N . , Tanaka A. , Kotani H., Nomura N. , Ohara 0. ; 

RT "Prediction of the coding sequences of unidentified human genes. XII. 

RT The complete sequences of 100 new cDNA clones from brain which code 

RT for large proteins in vitro."; 

RL DNA Res. 5:355-364(1998). 

RN [10] 



RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 3) . 

RC TISSUE=Brain, Pancreas, Placenta, and Skeletal muscle; 

RX MEDLINE=22388257; PubMed-124 77 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D. , Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B. # Buetow K.H., Schaefer C.F., Bhat N.K. , 

RA Hopkins R.F., Jordan H., Moore T. , Max S.I., Wang J., Hsieh F . , 

RA Diatchenko L. , Marusina K. , Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton m., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K. J. , Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K. , Muzny D.M.| rgren E.J., Lu X., Gibbs R.A., 

RA Fahey J . , Helton E . , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences . " ; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [11] 

RP SEQUENCE FROM N.A. (ISOFORM 3) . 

RX MEDLINE=20499367 ; PubMed=ll 042 152 ; 

RA Zhang Q.-H., Ye M. , Wu X.-Y., Ren S.-X., Zhao M. , Zhao C.-J., Fu G. , 

RA ShenY., Fan H.-Y., LuG., Zhong M. , XuX.-R., HanZ.-G., Zhang J . -W . , 

RA Tao J., Huang Q.-H., Zhou J., Hu G.-X., Gu J . , Chen S.-J., Chen Z . ; 

RT "Cloning and functional analysis of cDNAs with open reading frames for 

RT 300 previously undefined genes expressed in CD34+ hematopoietic 

RT stem/progenitor cells."; 

RL Genome Res. 10:1546-1560(2000). 

RN [12] 

RP SEQUENCE OF 482-1192 FROM N.A. (ISOFORM 1/4) . 

RC TISSUE=Brain; 

RA Mao Y.M., Xie Y. , Zheng Z.H.; 

RL Submitted (MAY-1998) to the EMBL / GenBank / DDB J databases. 

RN [13] 

RP SEQUENCE OF 186-1192 FROM N.A. (ISOFORM 1) . 

RC TISSUE=Testis; 

RA Sha J . H . , Zhou Z . M . , Li J.M.; 

RL Submitted (JAN-2001) to the EMBL/ GenBank / DDB J databases. 

RN [14] 

RP TOPOLOGY . 

RC TISSUE=Brain ; 

RX MEDLINE-20129259; PubMed=106677 97 ; 

RA GrandPre T., Nakamura F. , Vartanian T. , Strittmatter S.M.; 

RT "Identification of the Nogo inhibitor of axon regeneration as a 

RT Reticulon protein."; 

RL Nature 403:439-444(2000). 

RN [15] 

RP FUNCTION. 

RC TISSUE=Brain ; 

RX =21069055; PubMed=11201742 ; 1412 

RA Fournier A.E., Grandpre T. , Strittmatter S.M.; 

RT "Identification of a receptor mediating Nogo-66 inhibition of axonal 



RT regeneration. " ; 

RL Nature 409:341-346(2001). 

RN [16] 

RP REVIEW. 

RX MEDLINE=21888956; PubMed=l 18 91768 ; 

RA Ng C.E.L., Tang B . L . ; 

RT "Nogos and the Nogo-66 receptor: factors inhibiting CNS neuron 

RT regeneration. " ; 

RL J. Neurosci. Res. 67:559-565(2002). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 

CC block the regeneration of the nervous central system in adults. 

CC Isoform 2 reduces the anti -apoptotic activity of Bcl-xl and Bel -2. 

CC This is likely consecutive to their change in subcellular 

CC location, from the mitochondria to the endoplasmic reticulum, 

CC after binding and sequestration. 

CC SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bel -2. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum. Anchored to the membrane of the endoplasmic reticulum 

CC through 2 putative transmembrane domains. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event =Alternative splicing; Named isoforms=4; 

CC Name«l; Synonyms=RTN 4A, Nogo~A, RTN-xL; 

CC IsoId=Q9NQC3-l; Sequence=Displayed ; 

CC Name=2; Synonyms=RTN 4B, Nogo-B, RTN-xS, Foocen-M; 

CC IsoId=Q9NQC3-2; Sequence=VSP_005655 ; 

CC Name=3; Synonyms-RTN 4C, Nogo-C, Foocen-S; 

CC IsoId=Q9NQC3-3; Sequence=VSP_005652 , VSP_005653; 

CC Name=4; 

CC IsoId=Q9NQC3-4; Sequence=VSP__005654 ; 

CC -!- TISSUE SPECIFICITY: Isoform 1 is specifically expressed in brain 
CC and testis and weakly in heart and skeletal muscle. Isoform 2 is 

CC widely expressed excepted for the liver. Isoform 3 is expressed in 

CC brain, skeletal muscle and adipocytes. Isoform 4 is testis- 

CC specific. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC -!- CAUTION: Ref.ll sequence differs from that shown due to 

CC frameshifts in positions 1149 and 1156. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; AJ251383; CAB99248.1; -. 

DR EMBL; AJ251384; CAB99249.1; 

DR EMBL; AJ251385; CAB99250.1; -. 

DR EMBL; AB040462; BAB18927.1; -. 

DR EMBL; AB040463; BAB18928.1; 

DR EMBL; AF148537; AAG12176.1; -. 

DR EMBL; AF148538; AAG12177.1; -. 

DR EMBL; AF087901; AAG12205.1; 

DR EMBL; AF320999; AAG40878.1; 

DR EMBL; AF132047; AAD31021.1; -. 

DR EMBL; AF132048; AAD31022.1; 



DR EMBL; AB015639; BAA83712.1; 

DR EMBL; AF077050; AAD27783.1; -. 

DR EMBL; AF177332; AAG17976.1; -. 

DR EMBL; AB020693; BAA74909.1; -. 

DR EMBL; BC001035; AAH01035.1; -. 

DR EMBL; BC007109; AAH07109.1; 

DR EMBL; BC014366; AAH14366.1; 

Query Match 98.2%; Score 5815; DB 1; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 2.2e-216; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 3; 
Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 ■ 

IMIIIIIIIIIIMIIIIIIIIilllllllllllMI Mill! Illlllli lllllll! 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

I I I I I I I I M M I M I I I I I I I I M M I I 

Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

MMIIIIIIMIMII IMIIIilllll llllll III IIIIIMIIMIMIMIIII 

Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

QY 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

IN: h : I I I I I I I I I I | | | | | || I I I I I I I I I I I 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

°b 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 3 00 

Qy 287 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 346 

Db 301 GSSFSVSPKAESAVIVANPREEI I VKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 406 

Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 420 

Qy 4 07 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 4 66 

1 1 1 f 1 1 1 1 1 1 1 1 1 m 1 1 1 1 r 1 1 f r 1 1 1 j 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 r r i i 1 1 1 1 

Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGI KDRSGAYI TCAPFNPAATES I ATNI F 480 

Qy 4 67 PLLEDPTSENXTDEKKI EEKKAQI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 

Ml IMMI > I I ' U U IIIIIIIIIIMIIIIIMII 

D] 3 481 PLLGDPTSENKTDEKKI EEKKAQI VTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 540 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

1 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ( 1 1 1 f 1 1 r j i i j 1 1 1 1 1 r r j j 1 1 1 j 1 1 1 1 1 1 r i r i r 1 1 1 1 1 

Db 541 EWANMPEGLTPDLVQEACESELMEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

IMIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIMMMIIIIIIIIIIIIMM 

Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 



Mlllll I I I I I I M.I I U:U Ml I Ml I I I I 

EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 72 0 



Db 


661 


EAMSVSLKKVSGI KEEI KEPENINAALQETEAPYI S I ACDLI KETKLSAEPAPDFSDYSE 


720 


Qy 


706 


MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDS I PDVPQKQDETVMLVKESLTETSFESMI 

Ml III Ml MM IIIIIMI IIIIIIIMII III II III llllllllllllllllllll 

MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 


765 


Db 


721 


780 


Qy 


766 


EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 


825 


Db 


781 


IMIM III MMIIIIMMIIIIIIIIIMIM Mlllll llllll MMMM MM 

EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 


840 


Qy 


826 


SNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


885 


Db 


841 


IIIIIMIIIIIIIIIMIMIIIIMIIMIIMIIIIIIIIIMIIIIMIIIIIIM 

SNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


900 


Qy 


886 


ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 


945 


Db 


901 


MIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMMIIIIIIIIII 

ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 


959 


Qy 


946 


TQAEIESI VKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSVVDLLYWRDIKKTGV 


1005 


Db 


960 


1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 | | | | || | | | | | | | | : | | | || | | | | | M | | | | | | | 
TQAEI ES I VKPKVLVKEAEKKLPSDTEKEDRS PSAI FSAELSKTS VVDLLYWRDI KKTGV 


1019 


Qy 


1006 


VFGASLFLLLSLTVFS I VSVTAYI ALALLSVTI SFRI YKGVI QAI QKSDEGHPFRAYLES 


1065 


Db 


1 090 


IMIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIMMIIIIMIIIIIII 

vt*Kji\tDlje J_iJ_tJ_fbijI vfbl Vb V iAi lALALLb V 1 lbrRI YKGVI QAI QKSDEGHPFRAYLES 


1079 


Qy 


1066 


EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1125 


Db 


1080 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 N 1 1 1 1 1 

EVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1139 


Qy 


1126 


LLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

IMIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIMIIIIIIIIIIIIIII 




Db 


1140 


LLI LAL I SLFSVPVI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 





RESULT 2 
RTN4_RAT 

ID RTN4_RAT STANDARD; PRT; 1163 AA. 

AC Q9JK11; Q9JK10; Q9R0D9; Q9WUE9 ; Q9WUF0; 

DT 28-FEB-2003 (Rel . 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) (Foocen) 

DE (Glut4 vesicle 20 kDa protein) . 

GN RTN4 OR NOGO. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia ; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID-10116; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 3), AND PARTIAL SEQUENCE. 

RC STRAIN=Sprague-Dawley; TISSUE=Adipocyte ; 

RX MEDLINE-99249816; PubMed=l 023 1557 ; 

RA Morris N.J., Ross S.A., Neveu J.M. , Lane W.S., Lienhard G.E.; 

RT "Cloning and characterization of a 22 kDa protein from rat adipocytes: 

RT a new member of the reticulon family."; 



RL Biochim. Biophys . Acta 1450:68-76(1999). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3) . 

RX MEDLINE=20129258; PubMed=10667796 ; 

RA Chen M.S., Huber A.B., Van der Haar M.E., Frank M., Schnell L. , 

RA Spillmann A. A. , Christ F., Schwab M.E.; 

RT "Nogo-A is a myelin-associated neurite outgrowth inhibitor and an 

RT antigen for monoclonal antibody IN-1."; 

RL Nature 403:434-439(2000). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 4) . 

RC STRAIN-Wistar Kyoto; TISSUE=Vascular smooth muscle; 

RA I to T., Schwartz S.M.; 

RT "Cloning of a member of the reticulon gene family in rat: one of two 

RT minor splice variants."; 

RL Submitted (FEB-1999) to the EMBL/ GenBank / DDB J databases. 

RN [4] 

RP FUNCTION. 

RX MEDLINE=22033691; PubMed-12 03 7567 ; 

RA GrandPre T. , Li S., Strittmatter S.M.; 

RT "Nogo-66 receptor antagonist peptide promotes axonal regeneration . " ; 

RL Nature 417:547-551(2 002). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 
CC block the regeneration of the nervous central system in adults (By 

CC similarity) . 

CC -!- SUBUNIT: Binds to RTN4 R . Interacts with Bcl-xl and Bel -2 (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Anchored to the 
CC membrane of the endoplasmic reticulum through 2 putative 

CC transmembrane domains (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event =Alternative splicing; Named isoforms-4; 

CC Name=l; Synonyms =Nogo -A , NI -220-250; 

CC IsoId=Q9JKll-l; Sequence=Displayed; 

CC Name=2; Synonyms -Nogo-B, Foocen-Ml; 

CC IsoId=Q9JKll-2; Sequence=VSP_005658; 

CC Name=3; Synonyms =Nogo - C , VP2 0; 

CC IsoId=Q9JKll-3; Sequence=VSP_005656 , VSP_005657; 

CC Name=4 ; Synonyms -Foocen-M2 ; 

CC IsoId=Q9JKll-4; Sequence=VSP_005659; 

CC -!- TISSUE SPECIFICITY: Isoforms 1, 2 and 3 are present in optic 
CC nerve, spinal cord and cerebral cortex. Isoforms 1 and 2 are 

CC present in dorsal root ganglion, sciatic nerve and PC12 cells 

CC after longer exposure. Isoforms 2 and 3 are detected in kidney, 

CC cartilage, skin, lung and spleen. Isoform 3 is expressed at high 

CC level in skeletal muscle. In adult animals isoform 1 is expressed 

CC mainly in the nervous system. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 



DR 


EMBL; AF051335 


- AAF01564. 


1 




JJK 


TTTv/TD T 7\ TO A O Ci 1~~ 1 

hiMbL; AJz4zy61 


• CAB71027. 


1 




nn 

JJK 


tPlUfDT . A TO A O CI (Z O 

JiMJb J_i / A J 2 4 z y 6 Z 


■ CAB71028. 


1, 




DR 


EMBL; AJ242 963 


• CAB71029. 


1, 




DR 


EMBL; AF132045 


• AAD31019. 


1, 




DR 


EMBL; AF132046 


• AAD3102 0. 


1, 




DR 


GO; GO: 0030176 


C 


endoplasmic reticulum membrane, intrinsic 


DR 


GO; GO: 0005635 


C: nuclear membrane; ISS. 


DR 


GO; 00:0005515, 


F 


protein binding activity; ISS. 


DR 


GO; GO:0019987, 


P 


negative 


regulation of anti-apoptosis; ISS 


DR 


GO; 00:0030517, 


P 


negative 


regulation of axon extension; ISS 


DR 


InterPro ; 


IPR0033* 


38; Reticulon. 


DR 


Pfam; PF02453; 


Reticulon; 


1. 


DR 


PROSITE; 


PS50845; 


RETICULON; 1. 


KW 


Endoplasmic reticulum; Alternative splicing; Transmembrane. 


FT 


DOMAIN 


1 


989 




CYTOPLASMIC (Potential) . 


FT 


TRANSMEM 


990 


1010 




POTENTIAL . 


r 1 


DOMAIN 


1011 


1104 




LUMENAL (Potential) . 


FT 


TRANSMEM 


1105 


1125 




POTENTIAL . 


FT 


DOMAIN 


1126 


1163 




CYTOPLASMIC (Potential) . 


FT 


DOMAIN 


976 


1163 




RETICULON. 


t 1 


DOMAIN 


33 


46 




POLY-GLU. 


r 1 


DOMAIN 


73 


76 




POLY-ALA . 


FT 


DOMAIN 


140 


145 




POLY- PRO. 


FT 


VARSPLIC 


1 


964 




Missing (in isoform 3) . 


FT 












/FTId=VSP_005656 . 


FT 


VARSPLIC 


965 


975 




AVLSAELS KTS -> MDGQKKHWKDK (in 


FT 












3) . 


FT 












/FTId=VSP_005657. 


FT 


VARSPLIC 


173 


975 




Missing (in isoform 2) . 


FT 












/FTId=VSP_005658 . 


FT 


VARSPLIC 


192 


975 




Missing (in isoform 4) . 


FT 












/FTId=VSP_005659 . 


FT 


CONFLICT 


1130 


1131 




MISSING (IN REF. 3; AAD31020) . 


SQ 


SEQUENCE 


1163 


AA; 1263 


86 


MW; 8CB894B09E94F0B6 CRC64 ; 



Query Match 72.5%; Score 4296.5; DB 1; Length 1163; 

Best Local Similarity 74.0%; Pred. No. 4.2e-158; 

Matches 885; Conservative 104; Mismatches 156; Indels 51; Gaps 19; 
Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 



Db 




Qy 




Db 



Qy 




Db 



Qy 



179 RGSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSL 224 



Db 



167 RGSGSVDETLFALPAASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 226 




Qy 



225 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 284 



II MINIM 

DLAEFSELEYS 286 



Db 


227 


SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 


286 


Qy 


285 


EMGSSFSVSPKAESAVIVANPREEI IVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDE 


344 


Db 


287 


Mill! Ml llhM 1 : 1 h 1 h : M : I : || : || || | ||l 

EMGSS FKGS P KGESA I LVENTKEEVI VRS KDKED - LVCSAALHS PQES PVG KEDR 


340 


Qy 


345 


WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNLE 


403 


Db 


341 


III M 1 III I IIMIIIIIIMIh llllh 1 1 IMM MM 

WSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANVE 


396 


Qy 


404 


SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIAT 


463 


Db 


397 


Nihil Mill : MM hi lllllll Ml 1 Illlll 1 Mill 

SKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTTA 


455 


Qy 


464 


NIFPLLEDPTSENXTDEKKIEEKKAQI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 
1 Illlll 1 1 1 1 1 1 1 1 1 1 1 1 . 1 1 1 1 . 1 1 1 f 1 Illlll II I i i i iiiiii i i 


Db 


456 


' i • 1 I II • 1 II || | | I | | | || Mil MINI 1 : 1 

NTF PLLEDHTS ENKTDEKKI EERKAQ 1 1 TEK- TS P KTSNPFLVAVQDS EAD Y VTTDTLS K 


514 


Qy 


524 


VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 583 


Db 


515 


Ml IMMMMMMIMMIMM MIMIMMMMMM MMIM MM 

VTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 574 


Qy 


584 


PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 


642 


Db 


575 


II II M 1 1 1 1 II 1 M 1 1 1 M 1 II 1 1 -MINIMI Mill -Ml Mill 

PSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASVVQPSVSPLEAPPPVSYDSIKLEPENP 




Qy 


643 


PPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 


702 


Db 


635 


M M M M h II 1 II llllh MhllllllllMMIIIIIIMI MM MM 

PPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSN 


694 


Qy 


703 


YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 


762 


Db 


695 


MIMI 1 , 1 M 1 1 : 1 ! 1 ! M hi I MM | 1 

YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSI PEVPQTQEEAVMLMKESLTEVS-E 


753 


Qy 


763 


SMI EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELST 


822 


Db 


754 


= = =1 = 111 1 1 M 1 1 1 1 i 1 : : 1 MM IhlMM ||||| M 
TVAQHK- EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDI PTLTKKEKI SLQMEEFNT 


811 


Qy 


823 


AVYSNDDLFI SKEAQI RETETFSDSSP I E I I DEFPTLI SSKTDSFSKLAREYTDLEVSHK 


882 


Db 


812 


IMIIMI III , MM II IMMMIMM 1 

AI YSNDDLLSSKEDKIKESETFSDSSPIEI IDEFPTFVSAKDDS-PKLAKEYTDLEVSDK 


870 


Qy 


883 


SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 


942 


Db 


871 


Mill II MM III III III || ::: MM IMI M MM 
SEIANIQSGADSLPCLELPCDLSFKNI YPK- -DEVHVSDEFSENRSSVSKASI SPSNVSA 


928 


Qy 


943 


LGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKK 


1002 


Db 


929 


1 1 1 : MM 1 1 MIIMMMMIMM Ih MM i , 1 

L-EPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSVVDLLYWRDIKK 


987 


Qy 


1003 


TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAY 


1062 


Db 


988 


1 II II 1 II M II M M II 1 II II 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 II II II 1 1 

TGWFGASLFLLLSLTVFS I VSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAY 


1047 


Qy 


1063 


LESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 


1122 



IIMMMMIIMIIIMIMM MIMIIIIIIIIIIIIIIIIIMIIMIIIIIII 



Db 



1048 LESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1107 



Qy 1123 GLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I M I I II I M Ml I I I I MM I MIM I M M III II I I I Ml IM 

Db 1108 GLTLLILALISLFSI PVI YERHQVQIDHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

RESULT 3 
RTN4_MOUSE 

ID RTN4_MOUSE STANDARD; PRT; 199 AA. 

AC Q99P72; Q9CTE3; 

DT 28-FEB-2003 (Rel . 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 {Rel. 41, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) . 

GN RTN4 OR NOGO. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=3T3-L1; TISSUE=Adipocyte ; 

RA Coulson A.C., Craggs P.D., Morris N. J. ; 

RT "Mouse vp20/RTN4C cDNA . " ; 

RL Submitted (DEC-2000) to the EMBL/ GenBank/DDB J databases. 

RN [2] 

RP SEQUENCE OF 170-199 FROM N.A. 

RC STRAIN=C57BL/6J; TI SSUE= Embryo ; 

RX MEDLINE=21085660; PubMed=11217851 ; 

RA Kawai J., Shinagawa A., Shibata K. , Yoshino M . , Itoh M., Ishii Y. , 

RA Arakawa T. , Hara A., Fukunishi Y. , Konno H. , Adachi J. , Fukuda S. # 

RA Aizawa K. , Izawa M. , Nishi K. , Kiyosawa H., Kondo S., Yamanaka I., 

RA Saito T., Okazaki Y., Gojobori T. , Bono H. , Kasukawa T. , Saito R., 

RA Kadota K. , Matsuda H.A. , Ashburner M . , Batalov S., Casavant T. , 

RA Fleischmann W., Gaasterland T. , Gissi C. , King B., Kochiwa H. , 

RA Kuehl P., Lewis S., Matsuo Y. , Nikaido I., Pesole G., Quackenbush J. , 

RA Schriml L.M., Staubli F. , Suzuki R. , Tomita M., Wagner L. , Washio T. , 

RA Sakai K. , Okido T. , Furuno M. , Aono H. , Baldarelli R. , Barsh G., 

RA Blake J., Boffelli D. , Bojunga N. , Carninci P., de Bonaldo M.F., 

RA Brownstein M.J., Bult C. , Fletcher C. , Fujita M. , Gariboldi M . , 

RA Gustincich S., Hill D., Hofmann M., Hume D.A. , Kamiya M. , Lee N.H., 

RA Lyons P., Marchionni L . , Mashima J. , Mazzarelli J., Mombaerts P., 

RA Nordone P . , Ring B., Ringwald M. , Rodriguez I., Sakamoto N. , 

RA Sasaki H. , Sato K. , Schoenbach C. , Seya T. , Shibata Y. , Storch K.-F. # 

RA Suzuki H., Toyo-oka K. , Wang K.H., Weitz C. , Whittaker C. # Wilming L. , 

RA Wynshaw-Boris A., Yoshida K. , Hasegawa Y., Kawa j i H. , Kohtsuki S., 

RA Hayashizaki Y.; 

RT "Functional annotation of a full-length mouse cDNA collect ion. " ; 

RL Nature 409:685-690(2001). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 
CC block the regeneration of the nervous central system in adults (By 

CC similarity) . 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bel -2 (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Anchored to the 
CC membrane of the endoplasmic reticulum through 2 putative 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



transmembrane domains (By similarity) . 
ALTERNATIVE PRODUCTS: 

Event =Alternative splicing; Named isoforms=l; 

Comment =A number of isoforms may be produced; 
Name=l ; 

IsoId=Q99P72-l; Sequence=Displayed; 
SIMILARITY: Contains 1 reticulon domain. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 
the European Bioinf ormat ics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AF326337; AAK08076.1; 

EMBL; AK0038 59; -; NOT_ANNOTATED_CDS . 

MGD; MGI: 1915835; Rtn4 . 

GO; GO:0030176; C : endoplasmic reticulum membrane, intrinsic p. . .; ISS. 

GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

GO; GO:0005635; C: nuclear membrane; ISS. 

GO; GO:0005515; Frprotein binding activity; ISS. 

GO; GO: 0019987; P:negative regulation of anti-apoptosis ; ISS . 

GO; GO: 0030517; P:negative regulation of axon extension; ISS . 

InterPro; IPR003388; Reticulon. 

Pfam; PF02453; Reticulon; 1. 

PROSITE; PS50845; RETICULON; 1. 

Endoplasmic reticulum; Alternative splicing; Transmembrane. 



DOMAIN 

TRANSMEM 

DOMAIN 

TRANSMEM 

DOMAIN 

DOMAIN 

SEQUENCE 



1 25 CYTOPLASMIC (Potential) . 

26 50 POTENTIAL. 

51 13 7 LUMENAL (Potential) . 

138 162 POTENTIAL. 

163 199 CYTOPLASMIC (Potential) . 

12 199 RETICULON. 

199 AA; 22466 MW; 07BE5D580059ED9C CRC64 ; 



Query Match 15.4%; Score 915; DB 1; Length 199; 

Best Local Similarity 97.4%; Pred. No. 3.3e-29; 

Matches 186; Conservative 2; Mismatches 3; Indels 0; 



Gaps 



0; 



Qy 


988 


KTSWDLLYWRDI KKTGWFGASLFLLLSLTVFSI VSVTAYIALALLSVTI SFRI YKGVI 


1047 


Db 


9 


> 1 1 1 1 1 1 1 1 1 1 Ml 1 MM ! 1 1! 1 1 1 1 1 1 1 II 1 II I II M 1 1 1 II! || 1 IN 

KDKWDLLYWRD I KKTGWFGASLFLLLSLTVFSI VSVTAYIALALLSVTI SFRI YKGVI 


68 


Qy 


1048 


QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKF 


1107 


Db 


69 


MIIIIIMIilllllMMIMIIIIIIillMIMM llllllllllllllllllll 

QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKF 


128 


Qy 


1108 


AVLMWVFTYVGALFNGLTLLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI Q 

IMIMMIIIMIIIIIIIIIIIIIIIhllllMMIIIIIMIIMhlllllMM 


1167 


Db 


129 


AVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQAQ I DHYLGLANKS VKDAMAKI Q 


188 


Qy 


1168 


AKI PGLKRKAE 1178 




Db 


189 


1 1 1 1 i I f 1 1 1 1 

AKI PGLKRKAE 199 





RESULT 4 
RTN1 RAT 



ID RTN1_RAT STANDARD / PRT; 777 AA . 

AC Q64548; Q64547; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Reticulon 1 (Neuroendocrine-specif ic protein) (S-rex) . 

GN RTN1 OR NSP. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS RTN1-B AND RTN1-S) . 

RC STRAIN=Wistar ; TISSUE-Brain cortex; 

RX MEDLINE=96386034; PubMed=8 7 93 864 ; 

RA Baka I.D., Ninkina N.N., Pinon L.G.P., Adu J., Davies A.M. , 

RA Georgiev G.P., Buchman V.L.; 

RT "Intracellular compartmental izat ion of two differentially spliced s- 

RT rex/NSP mRNAs in neurons . n ; 

RL Mol. Cell. Neurosci. 7:28 9-303(1996). 

CC -!- FUNCTION: MAY BE INVOLVED IN NEUROENDOCRINE SECRETION OR IN 

CC MEMBRANE TRAFFICKING IN NEUROENDOCRINE CELLS. 

CC -!- SUBCELLULAR LOCATION: ENDOPLASMIC RETICULUM MEMBRANE (BY 

CC SIMILARITY) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=RTNl -B ; Synonyms = S - RexB ; 

CC IsoId=Q64548-l; Sequence=Displayed ; 

CC Name=RTNl-S; Synonyms =S -RexS ; 

CC IsoId=Q64548-2; Sequence=VSP_005647 , VSP_005648; 

CC -!- TISSUE SPECIFICITY: EXPRESSED PREDOMINANTLY IN CENTRAL AND 

CC PERIPHERAL NERVOUS SYSTEM OF NEWBORN AND ADULT RATS. LOW LEVELS 

CC HAVE BEEN ALSO DETECTED IN HEART, ADRENAL GLAND AND SPLEEN. 

CC EXPRESSION OF ISOFORM RTN1-B IS RESTRICTED TO PARTICULAR NEURONAL 

CC TYPES . 

CC -!- DEVELOPMENTAL STAGE: DETECTED ON EMBRYONIC DAY E10 IN THE 

CC HINDBRAIN AND IN Ell IN THE FOREBRAIN . DURING THE NEXT 3 EMBRYONIC 

CC DAYS THE LEVELS OF S-REXS INCREASES AND REMAINS STABLE AT E13 IN 

CC THE HINDBRAIN AND AT E14 IN THE FOREBRAIN. THE LEVELS OF S-REXB 

CC DOES NOT CHANGE AS SIGNIFICANTLY DURING DEVELOPMENT OF THE 

CC HINDBRAIN. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL ; U17604; AAC53046.1; 

DR EMBL; U17603; AAC53045.1; -. 

DR InterPro; IPR003388; Reticulon. 



DR 


Pfam; PF02453; Reticulon; 


1 . 




DR 


PROSITE; 


PS50845; 


RETICULON; 


1. 




Endoplasmic reticulum; Alternative splicing; Transmembrane. 


FT 


TRANSMEM 


604 


624 




POTENTIAL. 


FT 


TRANSMEM 


727 


747 




POTENTIAL. 


r 1 


DOMAIN 


590 


111 




RETICULON . 


FT 


DOMAIN 


610 


613 




POLY -LEU. 


FT 


VARSPLIC 


1 


569 




Missing (in isoform RTN1-S) . 


FT 










/FTId=VSP_005647. 


FT 


VARSPLIC 


570 


589 




I PGPLGSDLVPPLPFFNKQK -> MQATAD 


FT 










KSQ (in isoform RTN1-S) . 


FT 










/FTId=VSP_005648 . 


SQ 


SEQUENCE 


777 AA; 


83001 




; AF7479C50F28D0AC CRC64 ; 



Query Match 13.4%; Score 791; DB 1; Length 777; 

Best Local Similarity 26.6%; Pred. No. le-23; 

Matches 290; Conservative 129; Mismatches 308; Indels 364; Gaps 39; 

Qy 134 PEDDEPPARPPPPPPASVSPQAE-PVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKI 192 

I : = I I I = I M II II h II 

Db 5 PDLQDEPLSPANPGSQLFGGRGEGEEATPKGARPAQQDGEPAWGS - -GAGAGWS- - 57 

QY 193 MDLKEQPGNTI SAGQEDFPSVLLETA ASXP - SLS PLSAASFKEHE YLGNLSTV 244 

-I II MM h I :| h = h I I ■ I : 

Db 58 SRGLCSGPARSPPVAMETASTGVAAVPDALDHSSSPTLKDGEGACYTSLISDI 110 

Qy 245 L- - PTE GTLQE N VS EAS KE VS EKAKTLL I DRDLTEFS ELE YS EMGS S FS 291 

II I 11= Ih :h : I : II I h I : 

Db 111 CYPPREDSAYFTGILQKENGHITTSESPEELGTPGPS-LPEVPGTEPHGLLSSDSGIEMT 169 

Qy 2 92 VSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKA 351 
Db 170 PAESTEVNKI LADPLDQ 186 

Qy 3 52 KDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCF 411 

l : l I I I =1 I I 
Db 187 MKAEACKYIDITRPQEAKGQEEQSPGL 213 

Qy 412 ADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATES- - IATNIFPLL 469 

= 11 = = h : |||: II h I I hi 
214 EDKDLDFKDKDSEVSTKPEGVH APNQPS PVEGKLI KDNLF 253 

Qy 470 EDPTSENXTDEKKIEEKKAQIVTE- -KNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEE 527 

hi II h : HI I I h I I - 

Db 254 EESTFAPYIDELSDEQHRMSLVTAPVKITLTEIGPPVMTATHET 297 

Qy 528 WANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFE 587 

: I I I II. I h h 

Db 2 98 IPE KQDL CLKPSPDTVPTVT 317 

Qy 588 ESEA TPS PVLPDI VMEAPLNSAVPSAGASVI QPSSSP LEASSVNYESI KH 637 

I I =111 I : III I : M || ::|h 
Db 318 VSEPEDDSPGSVTP PSSGTEPSAAESQGKGSVSEDELIAAI KEAKGLSYET 3 68 

Qy 638 EPENPPPYEEAMSVSLKVSGIKEEIKE PENI NAALQETEAP Y I S I ACDL I KET 690 



Db 



369 - TES PRP VGQAAD RPKVKARSGLPTI PSSLDQEASSAESGDSEI - -ELVSED 417 



Qy 


691 


Db 


418 


Qy 


751 


Db 


460 


Qy 


811 


Db 


507 


Qy 


868 


Db 


535 


Qy 


928 


Db 


566 


Qy 


988 


Db 


587 


Qy 


1048 


Db 


647 


Qy 


1108 


Db 


707 


Qy 


1168 


Db 


767 



II III II 



III' II = =| h = : 

'PM--KPGV LD AIREETSSRATE 506 



I hi h | : |: : 

-SHQGPVEPDPILSFTPVTLQSRPE- - 534 



I III h I I lh I I- 

-PSSGDGAPVPEPP KSQQQKPEEEAVSS SQSP 565 



= M -I : I h 

AAT EIPGPLGSDLVPPLPFFN 586 



I :|IIIMMhlhllh I II Ml IhlM Ihlll II ' h 

KQKAI DLLYWRDI KQTGI VFGSFLLLLFSLTQFSWS WAYLALAALSATI SFRI YKSVL 

QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKF 

M M I I [ h :|:| ! I :|| |: |||| |||||||| 

QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDLVDSLKF 

AVLMWVFTYVGALFNGLTLLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI Q 
Mllh IIIIMIMIII|::|::|:h:||:| :||||:| |||| :: :|||| 
AVLMWLLTYVGALFNGLTLLLMAWSMFTLPVVYVKHQAQVDQYLGLVRTHINTWAKIQ 

AKI PGLKRKAE 1178 

Mill II II 



RESULT 5 
RTN1_HUMAN 

ID RTNl_HUMAJSh STANDARD; PRT; 776 AA. 

AC Q16799; Q16800; Q16801; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Reticulon 1 (Neuroendocrine-specif ic protein) . 

GN RTN1 OR NSP. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
OX NCBI_TaxID=9606 ; 
RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS RTN1-A; RTN1-B AND RTN1-C) . 

RC TISSUE=Lung carcinoma; 

RX MEDLINE=93293865; PubMed=7685762 ; 

RA Roebroek A.J.M., Van de Velde H.J.K., Van Bokhoven A., Broers J.L.V., 
RA Ramaekers F.C.S., Van de Ven W.J.M.; 



RT "Cloning and expression of alternative transcripts of a novel 

RT neuroendocrine-specif ic gene and identification of its 135-kDa 

RT translational product . " ; 

RL J . Biol. Chem. 268:13439-13447(1993). 

RN [2] 

RP ALTERNATIVE SPLICING . 

RX MEDLINE=9642 9995; PubMed=8 833 14 5 ; 

RA Roebroek A.J.M., Ayoubi T.A.Y., Van de Velde H.J.K. # 

RA Schoenmakers E.F.P.M., Pauli I.G.L., Van de Ven W.J.M.; 

RT "Genomic organization of the human NSP gene, prototype of a novel gene 

RT family encoding ret iculons . " ; 

RL Genomics 32:191-199(1996). 

RN [3] 

RP TISSUE SPECIFICITY. 

RX MEDLINE=98228245; PubMed=95604 66 ; 

RA Hens J . , Nuydens R. , Geerts H. , Senden N.H., Van de Ven W.J.M., 

RA Roebroek A.J., van de Velde H.J., Ramaekers F.C., Broers J.L.; 

RT "Neuronal differentiation is accompanied by NSP-C expression."; 

RL Cell Tissue Res. 292:229-237(1998). 

CC -!- FUNCTION: MAY BE INVOLVED IN NEUROENDOCRINE SECRETION OR IN 

CC MEMBRANE TRAFFICKING IN NEUROENDOCRINE CELLS . 

CC -!- SUBCELLULAR LOCATION : Endoplasmic reticulum membrane. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event ^Alternative splicing; Named isoforms-3; 

CC Name=RTNl-A; Synonyms = NSP -A; 

CC IsoId=Q16799-l; Sequence=Displayed; 

CC Name=RTNl-B; Synonyms =NSP-B; 

CC IsoId=Q16799-2; Sequence=VSP_0 05644 ; 

CC Name=RTNl-C; Synonyms = NSP-C ; 

CC IsoId=Q16799-3; Sequence=VSP_005645 , VSP_005646; 

CC -!- TISSUE SPECIFICITY: EXPRESSED IN NEURAL AND NEUROENDOCRINE TISSUES 
CC AND CELL CULTURES DERIVED THEREFROM. EXPRESSION OF ISOFORM RTN1-C 

CC IS STRONGLY CORRELATED WITH NEURONAL DIFFERENTIATION. 

CC -!- PTM: ISOFORMS RTN1-A AND RTN1-B ARE PHOSPHOR YLATED . 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; L10333; AAA59950.1; -. 

DR EMBL; L10334; AAA59951.1; 

DR EMBL; L10335; AAA59952.1; 

DR PIR; A46583; A46583 . 

DR PIR; 160904; 160904. 

DR Genew; HGNC: 104 67; RTN1. 

DR MIM; 600865; -. 

DR GO; GO:0030176; C : endoplasmic reticulum membrane, intrinsic p. . .; TAS. 

DR GO; GO: 0004871; F: signal transducer activity; NAS . 

DR GO; GO: 0030182; P:neuron differentiation; TAS. 

DR GO; GO: 0007165; P: signal transduction; NAS . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 



DR 


PROSITE; 


PS50845; 


RETICULON; 


1. 


KW 


Endoplasmic reticulum; Alternative splicing; Transmembrane; 


KW 


Phosphorylation . 






FT 


TRANSMEM 


603 


623 


POTENTIAL . 


FT 


1RANSMEM 


726 


746 


POTENTIAL . 


FT 


DOMAIN 


589 


776 


RETICULON . 


FT 


DOMAIN 


609 


612 


POLY -LEU. 


FT 


VARSPLIC 


1 


420 


Missing (in isoform RTN1-B) . 


FT 








/FTId=VSP_005644 . 


FT 


VARSPLIC 


1 


568 


Missing (in isoform RTN1-C) . 


FT 








/FTId=VSP_005645 . 


FT 


VARSPLIC 


569 


588 


GPGPLGPGAPPPLLFLNKQK -> MQATAD 


FT 








KSQ (in isoform RTN1-C) . 


FT 








/FTId=VSP_005646. 


SQ 


SEQUENCE 


776 AA; 


83617 MW; 


CA5B6232353096FE CRC64 ; 



Query Match 13.3%; Score 787.5; DB 1; Length 776; 

Best Local Similarity 31.9%; Pred. No. 1.4e-23; 

Matches 229; Conservative 91; Mismatches 190; Indels 207; Gaps 25; 

Qy 588 ESEATPSPVLPDI - -VMEAPLNSA- - VPSAGASVIQPSSSPLE ASSVNY 632 

■ ■ I h h | : : ||: | : | 

Db 141 EELGTPGPSLPDVPGIESRGLFSSDSGI EMTPAESTEVNKILADPLDQMKAEAYKYIDIT 200 

Qy 633 - -ESIKHEPENPPPYEEA MS VS LKVS G I KE E I KE P EN I NAAL QET 675 

Db 2 01 RPEEVKHQEQHHPELEDKDLDFKNKDTDI S I KPEGVREPDK- PAP VEGKI I KDHLLEEST 259 

Qy 676 EAPYISIACDL 1 KETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVED 722 

llll II : I I lh - ■-: h - 

Db 260 FAPYID DLSEEQRRAPQI TTPVKI TLTE I EPSVE TTTQEKTPEKQDI CLK 309 

Qy 723 SSPDSEPVDLFS DDS IPDVPQKQ DETVMLVKESLTETSF 761 

Mh I I III | : | || : :||: 
Db 310 PS PDTVPTVTVSEPEDDSPGS I TPPSSGTEPSAAESQGKGS I SEDELI TAI KEA 363 

Qy 762 ESMI EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELS 821 

= II I : :| Mill::: 

Db 364 - KGLS YETAENPRPVGQLADRP EVKARSGPPTI PSPLDHEA 403 

Qy 822 TAVYSNDDLFI SKEAQI RETETFSDSSPI EI IDEFP TLISSKTDSFS 868 

-M I II- M : I II 

Db 4 04 SSAESGD S E I ELVS EDPMAAEDALP SG Y VS FGH VGG P P P S P 444 

Qy 869 KLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSD 921 

: M : h : I : : I : | 

Db 445 ASPS I QYS I LREEREAELDSELI I ESCDASSAS EESPKREQDSPPMKPSALD 496 

Qy 922 DF SKNGSATSKVLL LP PDVSALGHTQAE I ES I VKP 956 

h I I I INN I h : | 

Db 497 AIREETGVRAEERAPSRRGLAEPGSFLDYPSTEPQPGPELPPGDGAL EPETPMLP 551 

Qy 957 KVLEKEAEKKLPSDTEKEDRSPSA 1 FSADLGKTS WDLLYWRD I K 1001 

: I : -I hi =| || =| Mill I II 

Db 552 RKPEEDSSSNQSPAATKGPGPLGPGAPPPLLF LNKQKAI DLLYWRDI K 599 

Qy 1002 KTGWFGASLFLLLSLTVFS I VSVTAYI ALALLS VTI SFRI YKGVI QAI QKSDEGHPFRA 1061 



Db 


600 


Hhllh 1 II Ml Ihlll | | : M 1 II llllllll hlhlhlllllhl 
QTGI VFGSFLLLLFSLTQFSWSWAYLALAALSATISFRI YKSVLQAVQKTDEGHPFKA 659 


Qy 


1062 


YLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1121 


Db 


660 


IN h -M :||h: :|| hlllllllll llllllllllllh lllllll 
YLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGALF 719 


Qy 


1122 


NGLTLLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 


Db 


720 


IMMh:h:hh:|hl i |||| :: UMIIIIII II II 
NGLTLLLMAWSMFTLPWYVKHQAQIDQYLGLVRTHINAWAKIQAKI PGAKRHAE 776 



RESULT 6 
RTN3_MOUSE 

ID RTN3_MOUSE STANDARD; PRT; 237 AA. 

AC Q9ES97; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Reticulon protein 3. 

GN RTN3. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi • 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Huang X., Zhou Y. , Qiang H . , Yuan J., Qiang B.; 

RT "Cloning and expression profile of a novel mouse cDNA encoding a human 

RT RTN3 homo log, " ; 

RL Submitted (OCT-1999) to the EMBL/ GenBank/DDBJ databases. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum (Potential) . 

CC -!- SIMILARITY: Contains 1 reticulon domain. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib. ch) . 



DR EMBL; AF195940; AAG31360.1; -. 

DR MGD; MGI : 1339970; Rtn3 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Transmembrane; Endoplasmic reticulum. 

FT TRANSMEM 69 8 9 POTENTIAL. 

FT TRANSMEM 167 187 POTENTIAL. 

FT DOMAIN 4 9 237 RETICULON. 

SQ SEQUENCE 237 AA; 25428 MW; EB60A0A7AC45F0DE CRC64; 



Query Match 10.6%; Score 626; DB 1; Length 237; 

Best Local Similarity 53.2%; Pred . No. 5.1e-18; 

Matches 125; Conservative 41; Mismatches 53; Indels 16; Gaps 4; 



CC 
CC 
CC 
CC 
CC 
CC 
CC 



Qy 960 EKEAEKKLPS -DTEKEDRSPSAI FSAD LGKTS WDLLYWRDI KKTG 1004 

I - M - llh M I I ||::|lhllll 

Db 3 ES SAATQS PS VS S S S SGAE P S ALGGGGGS PGACPALGAKS CGS S CAVHDL I FWRDVKKTG 62 

Qy 1005 WFGASLFLLLSLTVFS I VS VTAY I ALALLS VTI SFR I YKGVI QAI QKSDEGHPFRAYLE 1064 

Ml = 1 : I I I I lh:|| :h I I I I I I I I I I I : I I Illhllhllllhlih 
Db 63 FVFGTTLIMLLSLAAFSVISWSYLILALLSVTISFRVYKSVIQAVQKSEEGHPFKAYLD 122 

Qy 1065 SEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVTDDL VDSLKFAVLMWVFTYVGALFNGL 1124 

: : : I I I hh III H : llllhlllllll | | Ih llllhllh 

Db 123 VD I TLS S EAFHN YMNAAM VH VNKALKL 1 1 RLFLVEDLVDSLKLAVFMWLMTY VGA VFNG I 182 

Qy 1125 TLLI LALI SLFS VPVI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGL - KRKAE 1178 

MUM : :||||::|h:: MMhhl | : ||||hlh hill 

Db 183 TLL I LAELLVFS VP I VYEKYKTQ I DHYVG I ARDQTKS I VEKI QAKLPGI AKKKAE 237 

RESULT 7 
RTN3_HUMAN 

ID RTN3_HUMAN STANDARD; PRT; 236 AA. 

AC 095197; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Reticulon protein 3 (Neuroendocrine-specif ic protein-like 2) {NSP-like 

DE protein II) (NSPLII) . 

GN RTN3 OR NSPL2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RC TISSUE=Retina; 

RX MEDLINE=99265974; PubMed=1033 1947 ; 

RA Moreira E.F., Jaworski C.J., Rodriguez I.R.; 

RT "Cloning of a novel member of the reticulon gene family (RTN3) : gene 

RT structure and chromosomal localization to llql3." ; 

RL Genomics 58:73-81(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Huang X., Zhou Y. , Du G. , Yuan J., Qiang B. ; 

RT "Cloning and expression analysis of a cDNA encoding a novel 

RT neuroendocrine-specif ic protein-like protein 1: NSPL1 . " ; 

RL Submitted (JAN-1999) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain, Eye, and Lymph; 

RX MEDLINE=22388257; PubMed=124 7 7 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H. , Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T. , Max S.I., Wang J., Hsieh F . , 

RA Diatchenko L. , Marusina K. , Farmer A. A. , Rubin G.M., Hong L. , 

RA Stapleton M., Scares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 



RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J . A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S. # Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M. , Sodergren E.J. # Lu X., Gibbs R.A., 

RA Fahey J . , Helton E . , Ketteman M . , Madan A., Rodrigues S., Sanchez A. # 

RA Whiting M . , Madan A., Young A. C. , Shevchenko Y . , Bouf f ard G.G. , 

RA Blakesley R.W., Touchman j.w. , Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J. , Schmutz J. , Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum (Potential) . 

CC -!- TISSUE SPECIFICITY: WIDELY EXPRESSED WITH HIGHEST EXPRESSION IN 
CC BRAIN. THREE TIMES MORE ABUNDANT IN MACULA THAN IN PERIPHERAL 

CC RETINA. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This' SWISS- PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; AF059524; AAC99319.1; -. 

DR EMBL; AF059529; AAD20951.1; -. 

DR EMBL; AF059525; AAD20951.1; JOINED. 

DR EMBL; AF059526; AAD20951.1; JOINED. 

DR EMBL; AF059527; AAD20951.1; JOINED. 

DR EMBL; AF059528; AAD20951 . 1 ; JOINED. 

DR EMBL; AF119297; AAD26810.1; -. 

DR EMBL; BC000634; AAH00634.1; -. 

DR EMBL; BC010556; AAH10556.1; -. 

DR EMBL; BC011394; AAH11394.1; -. 

DR EMBL; BC022993; AAH22 993.1; 

DR Genew; HGNC: 104 69; RTN3 . 

DR MIM; 604249; -. 

DR GO; GO: 0005615; C : extracel lular space; TAS. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Transmembrane; Endoplasmic reticulum. 

FT TRANSMEM 68 88 POTENTIAL. 

FT TRANSMEM 177 197 POTENTIAL. 

FT DOMAIN 4 8 236 RETICULON. 

SQ SEQUENCE 236 AA; 25609 MW; DDC6A4 5 4 4ABCDFB7 CRC64 ; 

Query Match 10.6%; Score 625.5; DB 1; Length 236; 

Best Local Similarity 54.8%; Pred. No. 5.4e-18; 

Matches 119; Conservative 41; Mismatches 56; Indels 1; Gaps 1; 

QY 963 AEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSI 1022 

HI 1= H IhUlhlMI III I :|||| lh 



Db 


20 


AEPSAPGGGGSPGACPALGTKSCSSSCAVHDLIFWRDVKKTGFVFGTTLIMLLSLAAFSV 


79 


Qy 


1023 


VSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSAL 


1082 


Db 


80 


HI :h MINIMUM 1 M 1 M 1 1 M 1 I I N I I M :: :|| 1 hh 
I SWS YLI LALLSVTI SFRI YKSVI QAVQKSEEGHPFKAYLDVDITLSSEAFHNYMNAAM 


139 


Qy 


1083 


GHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFSVPVI YE 


1142 


Db 


140 


hi 1 : IMIMM M II Ih llllhllhllllll : - ! IMM 

VH I NRALKL 1 1 RLFL VEDLVDS LKLAVFMWLMT Y VGAVFNG I TLL I LAELL I FS VP I VYE 


199 


Qy 


1143 


RHQAQI DHYLGLANKNVKDAMAKI QAKI PGL-KRKAE 1178 

= = = MMhhl 1 : llllhlh hill 
KYKTQI DHYVGI ARDQTKS I VEKI QAKLPGI AKKKAE 23 6 




Db 


200 





RESULT 8 
RTN2 HUMAN 



ID RTN 2 __HUMAN STANDARD; PRT; 54 5 AA . 

AC 075298; 060509; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Reticulon protein 2 (Neuroendocrine-specif ic protein-like 1) (NSP-like 

DE protein 1) (NSPLI) . 

GN RTN2 OR NSPLI . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCB I_Tax I D= 9 6 0 6 ; 

RN [1] 

RP SEQUENCE FROM N.A., ALTERNATIVE SPLICING, AND TISSUE SPECIFICITY. 

RC TISSUE=Lung carcinoma; 

RX MEDLINE=98360096 ; PubMed-9693 037 ; 

RA Roebroek A.J.M., Contreras B. , Pauli I.G.L., Van de Ven W.J.M.; 

RT "cDNA cloning, genomic organization, and expression of the human RTN2 

RT gene, a member of a gene family encoding reticulons . " ; 

RL Genomics 51:98-106(1998). 

RN [2] 

RP SEQUENCE OF 108-545 FROM N.A. (ISOFORM RTN2 -B) . 

RC TISSUE=Brain; 

RX MEDLINE=98191726; PubMed=953 0622 ; 

RA Geisler J.G. , Stubbs L.J., Wasserman W.W. , Mucenski M . L . ; 

RT "Molecular cloning of a novel mouse gene with predominant muscle and 

RT neural expression ; 

RL Mamm. Genome 9:274-282(1998). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 
CC reticulum (Potential) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event ^Alternative splicing; Named isoforms=2; 

CC Name=RTN2-A; 

CC Isold=075298-1; Sequence=Displayed; 

CC Note=Isoform RTN2-C is produced by alternative initiation at 

CC Met -341 of isoform RTN2-A; 

CC Name=RTN2-B; 

CC IsoId=075298-2; Sequence=VSP_00564 9 ; 

CC Event ^Alternative initiation; 

CC Comment =2 isoforms, RTN2-A (shown here) and RTN2-C, are produced 



CC by alternative initiation at Met-1 and Met-341; 

CC -!- TISSUE SPECIFICITY: ISOFORM RTN2-C IS HIGHLY EXPRESSED IN SKELETAL 
CC MUSCLE . 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF004222; AAC32542.1, 

DR EMBL; AF004223; AAC32543.1; 

DR EMBL; AF004224; AAC32544.1; 

DR EMBL; AF038540; AAC14910.1; 

DR Genew; HGNC: 10468; RTN2 . 

DR MIM; 603183; -. 

DR GO; GO: 003 0176; C : endoplasmic reticulum membrane, intrinsic p. 

DR GO; GO: 0004871; F: signal transducer activity; NAS . 

DR GO; GO: 0007165; P: signal transduction; NAS. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane; 



NAS. 



KW 


Alternative 


initiation . 




FT 


CHAIN 


1 


545 


RETICULON PROTEIN 2, ISOFORM RTN2 


FT 


CHAIN 


341 


545 


RETICULON PROTEIN 2, ISOFORM RTN2 


FT 


INIT_MET 


341 


341 


FOR ISOFORM RTN2-C. 


FT 


TRANSMEM 


368 


388 


POTENTIAL. 


FT 


TRANSMEM 


463 


483 


POTENTIAL . 


FT 


DOMAIN 


345 


545 


RETICULON. 


FT 


VARSPLIC 


272 


344 


Missing (in isoform RTN2-B) . 


FT 








/FTId=VSP_005649 . 


SQ 


SEQUENCE 


545 AA; 


59263 MW; 


971FD2F909E1E9E6 CRC64; 


Query Match 




8.7%; 


Score 514; DB 1; Length 54 5; 



Best Local Similarity 28.2%; Pred. No. 2.9e-13; 

Matches 169; Conservative 81; Mismatches 186; Indels 164; Gaps 18, 

Qy 676 EAP Y I S I ACDL I KETKLSAEPAPDFSDYS EMAKVEQPVPDHS ELVEDSS PD - S EPVDL - F 733 

III I II I I I : I : : : I I :| I I :| I 

Db 13 EAP STASSTPDSTEGGNDDSDFRELHTAREFSEEDEE - - ETTSQDWGTPRELTF 64 

Qy 734 S DDSI PDVPQKQDETVMLVKESLTETSFESMI EYENKEKLSALPPEG 780 

I II h I U - : I =1 I I 

Db 65 SYIAFDGWGSGGRRDSTARRPRPQGRSVSEPRDQHPQPSLGDSLESI PSLSQSPEPGRR 124 

Qy 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVYSNDDLFISKEAQIRE 840 

II II I I I : I-: I :::: | 

Db 125 GDP DTAPPSE RPLEDLRLRLDHLG WVARGTGSGE 158 

Qy 841 TETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 900 

: I h I : I II I U II I 
Db 159 DSSTSSSTPLE- -DEEP QEPNRLETGEAGE EL 188 



J 



Qy 


901 


PHDLSLKNIQPKVEEKI S F S DD F S KNG SATS KVLLL P P D VS ALGHT 


946 


Db 


189 


--DLRLRLiAQPSSPEVLTPQLSPGSGTPQAGTPSPSRSRDSNSGPEEPLLEEEEKQWGPL 


246 


Qy 


947 


QAE 1 ESI VKPKVLEKEAE KKLP 


968 


Db 


247 


1 =1 l = |::| 1 I :| 
EREPVRGQCLDSTDQLEFTVEPRLLGTAMEWLKTSLLLAVYKTVPILELSPPLWTAIGWV 


306 


Qy 


969 


SDTEKEDRSPSAI FSADLGKTS WDLLYWRDI KKTGWFGASL 


1011 


Db 


307 


: : : | | | | : | : | | | | | | : | : : | | | | 
QRGPTPPTPVLRVLLKWAKSPRSSGVPSLSLGADMG-SKVADLLYWKDTRTSGWFTGLM 


365 


Qy 


1012 


FLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISE 


1071 


Db 


366 


Ml MlIM l-ll II Ml hh | = |l= = 1 : | : ! : | | | : - : : : 

VSLLCLLHFS I VSVAAHLALLLLCGTI SLRVYRKVLQAVHRGDGANPFQAYLDVDLTLTR 


425 


Qy 


1072 


ELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LAL 


1131 


Db 


426 


1 == 1= 1 : 1 1 IMMIMI hi :: MINIMUM : 
EQTERLSHQI TS R WS AATQLRHF FLVEDLVDSLKLALLF Y I LTF VGA I FNGLTLL I LG V 


485 


Qy 


1132 


I SLFS VP VI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGL - - - KRKAE 


1178 


Db 


486 


1 lh:h:| =111111 hll : , | ||| 
I GLFT I PLLYRQHQAQ I DQYVGLVTNQLSHI KAKI RAKI PGTGALASAAAAVSGS KAKAE 


545 



RESULT 9 
RTN2_MOUSE 

ID RTN2JVIOUSE STANDARD; PRT; 4 71 AA. 

AC 070622; 070620; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Reticulon protein 2 (Neuroendocrine-specific protein-like 1) (NSP-like 
DE protein 1) (NSPLI) . 



GN RTN2 OR NSPLI . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS 1 AND 2), AND TISSUE SPECIFICITY. 

RC STRAIN=FVB/N, and 129/Sv; TISSUE=Cerebellum, and Skeletal muscle; 

RX MEDLINE=98191726; PubMed=953 0622 ; 

RA Geisler J.G., Stubbs L.J., Wasserman W.W. , Mucenski M.L.; 

RT "Molecular cloning of a novel mouse gene with predominant muscle and 

RT neural expression. " ; 

RL Mamm. Genome 9:274-282(1998). 

CC -!- SUBCELLULAR LOCATION: Membrane-bound . Endoplasmic reticulum 

CC (Potential) . 

CC ALTERNATIVE PRODUCTS: 

CC Event =Alternative splicing; Named isoforms=2; 

CC Name= 1 ; Synonyms =Bra in ; 

CC Isold=070622-1; Sequence=Displayed; 

CC Name=2; Synonyms^Muscle ; 

CC IsoId=070622-2; Sequence=VSP_005650 , VSP_005651; 

CC -!- TISSUE SPECIFICITY: EXPRESSED PREDOMINANTLY IN NEURAL AND MUSCULAR 
CC TISSUES. 



CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; AF038537; AAC14906.1; -. 

DR EMBL; AF038537; AAC14907.1; -. 

DR EMBL; AF038538; AAC14908.1; -. 

DR EMBL; AF038539; AAC14909.1; 

DR EMBL; AF093624; AAD13195.1; 

DR MGD; MGI: 107612; Rtn2 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane. 

FT TRANSMEM 2 95 315 POTENTIAL. 

FT DOMAIN 272 4 71 RETICULON. 

FT VARSPLIC 1 267 Missing (in isoform 2). 

FT /FTId=VSP_005650. 

FT VARSPLIC 268 271 PLLL -> MGSK (in isoform 2) . 

FT /FTId=VSP_005651 . 

SQ SEQUENCE 471 AA; 51346 MW; 9BBD8F372CF63AD3 CRC64; 



Query Match 8.4%; Score 500; DB 1; Length 471; 

Best Local Similarity 29.3%; Pred. No. 8.3e-13; 

Matches 159; Conservative 83; Mismatches 177; Indels 124; Gaps 16; 

Qy 676 EAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPD-SEPVDL-F 733 

Ml III I I h I - : I I I :| | | :| | 

Db 13 EAP STASSTPDSTEGGNDDSDFRELHTAREFSEDEEE- -ETTSQDWGTPRELTF 64 

QY 734 S DDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEG 78 0 

I lh h I :| - :| II 

Db 65 SYIAFDGWGSGGRRDSWRRPRPQGRSVSEPRDPPQQSGLGDSLESI PSLSQSPEPGRR 124 

QY 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVYSNDDLFI SKEAQI RE 840 

II I : I I I- |::::| : : : I I 

Db 125 GDP DPVPPAE RPLEELRLRLDQLG WWRSAGSGE 158 

Qy 841 TETFSDSSPIEIIDEFPTLISS KTDSFSKLARE-YTDLEV SHKSEIAN 887 

I hhl :| | : = :|: :||: : ||| | : |: 

Db 159 DSATS SSTPLE - - NEEPDGLEASEAGEETNLELRLAQSLHLQLEVLTPQLS PSSGTPQAH 216 

QY 888 APDGAGSLPCTELPHDLSL QPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQ 94 7 

II II I l : III ::| | : 

Db 217 TPS PQRSQDSNSGPDDEPLLNV VEEH WRLLEQEPITAQCLDST 259 

Qy 948 AEI ESI VKPKVLEKEAEKKLPSDTEKEDRSPSAI FSADLGKTSWDLLYWRDI KKTGWF 1007 

Db 2 60 DQSEFMLEPLLL VADLLYWKDTRTSGAVF 288 



Qy 



10 08 GASLFLLLSLTVFS I VSVTAYI ALALLS VTI SFRI YKGVI QAI QKSDEGHPFRAYLESEV 1067 



: II I I I I I I I |::|| I III hh hlh : | :||:|||: - 
Db 28 9 TGLMASLLCLLHFS I VS VAAHLALLGLCATI SLRWRKVLQAVHRGDGTNPFQAYLDMDL 34 8 

Qy 1068 A I SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLL 1127 

= = I == I Ml : I I Ihllllll hi :: NIMIIIIh 

Db 34 9 TLTREQTERLSQQIASHWSTATQLRHFFLVEDLVDSLKLALLFYILTFVGAIFNGLTLV 4 08 

Qy 1128 I LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGL KR 1175 

II :::|hlh:| Hi! hM : MM ' 

Db 4 09 I LGWALFTVPLLYRQHQAQIDQYVGLVTNQLSHI KAKIRAKI PGTGTLAPTASVSGSKA 468 

Qy 1176 KAE 1178 

III 

Db 469 KAE 471 



RESULT 10 
CPN DROME 



ID CPN_DROME STANDARD; PRT; 865 AA. 

AC Q02910; 

DT 01-OCT-1993 (Rel . 27, Created) 

DT 01-OCT-1993 (Rel. 27, Last sequence update) 

DT 15-SEP-2003 (Rel. 42 , Last annotation update) 

DE Calphotin. 

GN CPN OR CAP. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota ; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae ; Drosophila. 

OX NCBI_TaxID=7227 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-Canton-S; 

RX MEDLINE=93165729; PubMed=8094559 ; 

RA Martin J.H., Benzer S., Rudnicka M. , Miller C.A. ; 

RT "Calphotin: a Drosophila photoreceptor cell calcium -binding protein."; 

RL Proc. Natl. Acad. Sci. U.S.A. 90:1531-1535(1993). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Canton-S ; 

RX MEDLINE=93165730 ; PubMed=8434 015 ; 

RA Ballinger D.G., Xue N. , Harshman K.D. ; 

RT "A Drosophila photoreceptor cell -specif ic protein, calphotin, binds 

RT calcium and contains a leucine zipper." ; 

RL Proc. Natl. Acad. Sci. U.S.A. 90:1536-1540(1993). 

CC -!- FUNCTION: Might function as a calcium-sequestering "sponge" to 
CC regulate the amount of free cytoplasmic calcium. It binds 0.3 mole 

CC of Ca(2+) per mole of protein. 

CC -!- SUBUNIT: Homodimer (Probable). 

CC -!- SUBCELLULAR LOCATION: CYTOPLASMIC; HYPODENSE COMPARTMENT. 

CC -!- TISSUE SPECIFICITY: SOMA AND AXONS OF PHOTORECEPTOR CELLS OF 

CC COMPOUND EYES AND OCELLI . 

CC -!- DEVELOPMENTAL STAGE: EXPRESSED EARLY IN PHOTORECEPTOR CELL 
CC DEVELOPMENT . 

cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 



cc 


the European Bioinf ormatics Institute. There are no restrictions on its 


cc 


use by 


non-profit institutions as long as its content is in no way 


cc 


modified 


and this 


statement is not removed. Usage by and for commercial 


LL 


entities 


requires 


a license agreement (See http://www.isb-sib.ch/announce/ 


CC 


or send an email to 1 icense@isb-sib . ch) . 


cc 








DR 


EMBL; L02111; AAA28405.1; -. 


DR 


EMBL; L05080; AAA2 


8420.1; -. 


DR 


PIR; A472 


82; A47282. 


UK 


PIR; A47283; A4728 


3 . 


UK 


FlyBase; 


FBgn0010218; Cpn . 


UK 


GO; GO: 0005509; F: 


calcium ion binding activity; IDA. 


T/TiT 


Calcium-binding . 




Dm 

r 1 


CONFLICT 


36 


36 A -> AVAPAWA (IN REF . 2) . 


r 1 


CONFLICT 


43 


43 I -> T (IN REF . 2) . 




CONFLICT 


64 


64 I -> V (IN REF. 2) . 


t 1 


CONFLICT 


76 


76 T -> A (IN REF . 2) . 


r 1 


CONFLICT 


100 


100 P -> PP (IN REF. 2) . 


FT 


CONFLICT 


126 


127 VQ -> AP (IN REF. 2) . 


FT 


CONFLICT 


154 


154 I - > V (IN REF. 2) . 


FT 


CONFLICT 


160 


160 S -> T (IN REF. 2) . 


FT 


CONFLICT 


534 


534 A -> E (IN REF. 2) . 


FT 


CONFLICT 


699 


699 I -> T (IN REF. 2) . 


FT 


CONFLICT 


703 


703 V -> L (IN REF. 2) . 


FT 


CONFLICT 


721 


721 D -> E (IN REF. 2) . 


SQ 


SEQUENCE 


865 AA; 


84781 MW; 2110417EOBOE7CFE CRC64; 


Query Match 




5.5%; Score 324.5; DB 1; Length 865; 



Best Local Similarity 22.3%; Pred . No. 9.6e-06; 
Matches 229; Conservative 128; Mismatches 375; Indels 297; Gaps 49; 

Qy 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

I : Mil I I Ih Mllh I I I hi hi 

Db 9 PVSAPVAAPV-TPSAVAAPVQVVSPAAVAPAPAAPIAVTPVAPPPTLASVQPATV- - TIP 65 

QY H9 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAP APAAPP--ST 172 

lhh:|hhl I I I.I I | || ||| :| 

Db 66 APAPIAAASVTP— -VASVAPPWAAPTPPAA-SPVSTPVAVAQI PVAVSAPVAPPVAAT 121 

Qy 173 PAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASF 232 

I : ' 1= I II h || | :||: |: 

Db 122 PTPWQIPVAAPVIAT PPVAASA PT PAAVTPVISPVIAS- - 160 

Qy 233 KEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSV 292 

hi I II- I - | :| 
Db 161 PPWPANTT VPVAAPVAAVPAAVPWAPVLAP AV 194 

Qy 293 SPKAESAVIVAN PREEI I VKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWS 347 

-I I HI I II I :| : h I Ih 

Db 195 APAV- -APWAETPAPPPVAEI PVAT 1 PECVAPLI PEVSWA 234 

Qy 34 8 SEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVD 4 07 

: I =1111 I | :: :|: : : | 

Db 235 T KPLAAAEPVWAPPATET PWAPAAASPHVSVAPAVETAWAPVS 280 



Qy 



408 KKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFP 467 



Db 


281 


ASTEPPV AAATLTTAPETPAL 


301 


Qy 


468 


LLEDPTSENXTDEKKIEEKKAQI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEE 

1 :| 1 Ml l| 


527 


Db 


302 


111 III || 

APWAESQ VAA NTWATPP 


320 


Qy 


528 


WANM P EGLT PDL VQEACES ELNE VTGTKI AYETKMDLVQTS E VMQES L Y PA 
| | | : | : | : | | | : | | | : | | | | ; | | 


579 


Db 


321 


TPAPEPETIAPPW AETPEVASVAVA-ETTPPW- -PPVAAESI - PAP WATT PV 


371 


Qy 


580 


-AQLCPSFEESEATPSPVLPDI VMEAPLNSAVPSAGASVI QPSSS PLEASSVNYES I KHE 


638 


Db 


372 


1 1 = : 1= 1 1 1 := = 1= 1 1 1 = 1 |= 1 1 
PATLAVTDPDVTASAVPELPPVIAPSPVPSAVAETPVDLAPPVLPPVAAEPVPAWAEET 


431 


Qy 


S639 


PENPPPYEEAMSV- SLKVSGI KEEI KEPENINAALQETEAPYI SI ACDLI KET PA 


697 


Db 


432 


PETPAPASAPVTIAALDIPEVAPVIAAPSDAPAEAPSAAAPI VSTPPTTASVPETTAPPA 


491 


Qy 


698 


p DFSDYSEMAKVEQPVPDHSEL VEDSSPDSEPVDLFSDDSI P- -DVPQK 


744 


Db 


492 


|| M 1 : 1 1 1 h | | : | | | : | : | 
AVPTEPIDVSVLSE-AAIETPVAPPVEVTTEVAVADVAPPEAAADLIIEPVEPPAPIPDL 


550 


Qy 


745 


QDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEV 


804 


Db 


551 


= =l = 1- = 1 -III - : = | 
LEQTTSVPAVEAAESTSSPIPE TSLPPPNEAVASPEVAVAPITAPEPI PEPEP 


603 


Qy 


805 


STLSKKEKI PLQ MEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLI 

1 : 1 II- -III 1 : || I | | | IN : 


860 


Db 


604 


SLATPTEP I PVEAPWI QEAVDAV EVPVTETST SIP- ETTVEFPEAV 


649 


Qy 


861 


SSKTDSFSKLA -SHKSEIANAPDGAGSLPCTELP tt — HD^ 906 


Db 


650 


■ 1 1 1 * 1 " * • • 1 1 : | : : : 

AEKV LDPAITEAPVTTQEPDVANINDGA PAT E I TT PAVE I VTAAAE VS D I A I 


701 


Qy 


907 


KNIQPKVEEKISFSDDFSKNGSATSKVLL LP PDVSALG 


944 


Db 


702 


1 1 1 -h = : = : -I- :| |:|| 

PVI DPP VPQE I AVA - E I P ETDTKPAEVI VEQST I P I EAPVPEVS KYAE PVI S EAPAAE VP 


760 


Qy 


945 


HTQAEIESI VKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSVVDLLY 


996 


Db 


761 


: 1 1 -'11:11 h h - Ml 1 = 1 = || 
ITAGDNPDNTSVGISEW-PTIAEKPVEEVPTSEIPEQSSSPSD- -SVPVAK- - ITPLL- 


814 


Qy 


997 


WRDIKKTGV 1005 




Db 


815 


II- 1 1 
-RDLQTTDV 822 





RESULT 11 
PCLO_HUMAN 

ID PCLO_HUMAN STANDARD; PRT; 514 7 AA . 

AC Q9Y6V0; 043373; 060305; Q9BVC8 ; Q9UIV2; Q9Y6U9 ; 
DT 28-FEB-2003 (Rel. 41, Created) 
DT 28-FEB-2003 (Rel. 41, Last sequence update) 
DT 15-SEP-2003 (Rel. 42, Last annotation update) 
DE Piccolo protein (Aczonin) (Fragments) . 
GN PCLO OR ACZ OR KIAA0559. 
OS Homo sapiens (Human) . 



OC Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebra ta; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 
RN [1] 

RP SEQUENCE OF 1-759 FROM N . A . 

RC TISSUE=Brain ; 

RX MEDLINE-99439764 ; PubMed=10508862 ; 

RA Wang X., Kibschull M. , Laue M.M., Lichte B., Petrasch-Parwez E. , 

RA Kilimann M.W. ; 

RT "Aczonin, a 550-kd putative scaffolding protein of presynaptic active 

RT zones, shares homology regions with rim and bassoon and binds 

RT prof il in. " ; 

RL J. Cell Biol. 147:151-162(1999). 
RN [2] 

RP SEQUENCE OF 552-4404 FROM N.A. 

RA Kraemer J., Wollam C. , Wohldmann P., McGrane B.; 

RL Submitted (DEC-1999) to the EMBL/ GenBank / DDB J databases. 

RN [3] 

RP SEQUENCE OF 3619-5147 FROM N.A. (ISOFORM 2) . 

RC TISSUE=Brain ; 

RX MEDLINE=98290545; PubMed=962 8 58 1 ; 

RA Nagase T. , Ishikawa K.-I., Miya j im 

RA Nomura N. , Ohara O. ; 

RT "Prediction of the coding sequences of unidentified human genes. IX. 

RT The complete sequences of 100 new cDNA clones from brain which can 

RT code for large proteins in vitro."; 

RL DNA Res. 5:31-39(1998). 
RN [4] 

RP SEQUENCE OF 4405-4439 FROM N.A. 

RC TISSUE=Placenta; 75X1 a A., Kotani H. , TP KD 442850XS 

RX MEDLINE=22388257; PubMed=124 77 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H. , Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T. , Max S.I., Wang J. , Hsieh F . , 

RA Diatchenko L., Marusina K. , Farmer A. A. , Rubin G.M., Hong L. , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A. , Peters G.J., AbY . , Mullahy StSB?:P2113 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J . , Helton E . , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y. , Bouffard G.G. , 

RA Blakesley R.W. , Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J. , Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A. , Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [5] 

RP SEQUENCE OF 4405-5147 FROM N.A. 

RA Kalicki J., Elliott G.; 

RL Submitted (FEB-1998) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: May act as a scaffolding protein involved in the 

CC organization of synaptic active zones and in synaptic vesicle 



CC trafficking (By similarity) . 

CC -!- SUBUNIT: Interacts with Rabacl/Pral and profilin (By similarity). 

CC -!- SUBCELLULAR LOCATION: Concentrated at the presynaptic side of 

CC synaptic junctions (By similarity) - 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event ^Alternative splicing; Named isoforms=2; 

CC Comment =Additional isoforms seem to exist ; 

CC Name=l; 

CC IsoId=Q9Y6V0-l; Sequence=Displayed; 

CC Name =2 ; 

CC IsoId=Q9Y6V0-2; Sequence=VSP_003 923 , VSP_003924, VSP_003925, 

CC VSP_003 92 6 / VSP_003927; 

CC Note=No experimental confirmation available; 

CC -!- DOMAIN: C2 domain 1 is involved in binding calcium and 

CC phospholipids. Calcium binds with low affinity but with high 

CC specificity and induces a large conformational change. 

CC -!- SIMILARITY: C C2 domains. KNY 2003 

CC -!- SIMILARITY: Contains 1 PDZ/DHR domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL out station - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; Y19188; CAB60727.1; 

DR EMBL; AC004903; AAD20936.1; 

DR EMBL; AC004886; AAD21789.1; 

DR EMBL; AB01113 1 ; 'BAA25485 . 1 ; 

DR EMBL; BC001304; AAH01304.1; 

DR EMBL; AC004082; AAB97937.1; 

DR PIR; T00634; T00634 . 

DR HSSP; P04410; 1A2 5. 

DR Genew; HGNC : 134 06; PCLO. 

DR MIM; 604918; -. 

DR GO; GO: 0005856; C : cytoskeleton ; NAS . 

DR GO; GO: 0045202; C: synaptic junction; ISS. 

DR GO; GO: 0005509; F: calcium ion binding activity; ISS. 

DR GO; GO: 0005544; F : cal cium-dependent phospholipid binding acti. . .; ISS. 

DR GO; GO: 0005522; F:profilin binding activity; ISS. 

DR GO; GO: 0007010; P : cytoskeleton organization and biogenesis; ISS. 

DR GO; GO:0016080; P : synapt ic ves icl e targeting; ISS. 

DR InterPro; IPR000008; 02. 

DR InterPro; IPR001565; Synaptotagmin . 

DR PRINTS; PR00360; C2DOMAIN. 

DR PRINTS; PR00399; SYNAPTOTAGMN . 

DR SMART; SM0023 9; C2 ; 2. 

DR PROSITE; PS00499; C2_DOMAIN_l; 1. 

DR PROSITE; PS50004; C2_DOMAIN_2 ; 2. 

KW Calcium/phospholipid-binding; Zinc; Metal -binding; Zinc-finger; 

KW Repeat; Alternative splicing. 

FT NON_TER 1 1 

FT DOMAIN 400 465 10 X 10 AA TANDEM APPROXIMATE REPEATS OF 

FT P-A-K-P-Q-P-Q-Q-P-X. 

FT ZN_FING 4 99 523 C4 -TYPE (POTENTIAL) . 



r 1 


Z.JM MNb 


969 


992 


C4-TYPE (POTENTIAL) . 


i_.ii 1 1 


NUN LUNb 


1010 


1011 




r 1 


Pl/~MVT A T M 

duiyiaijni 




2325 


POLY -PRO . 


FT 


DOMAIN 


4391 


4442 


PDZ . 


FT 


DOMAIN 


4544 


4633 


C2 DOMAIN 1 . 


FT 


DOMAIN 


5031 


5121 


C2 DOMAIN 2 . 


FT 


VARSPLIC 


4404 


4404 


S -> SGNGLGI RI VGGKEI PGHSGE I GAYI AKI LPGGSAE 


FT 








QTGKLMEG (in isoform 2) . 


FT 








/FTId=VSP_003923 . 


FT 


VARSPLIC 


4534 


4534 


K -> KPTDGTKWSH P I TGE I Q (in isoform 2) . 


FT 








/FTId=VSP_003924 . 


FT 


VARSPLIC 


4576 


4576 


G -> GQVMWQNAS (in isoform 2) . 


FT 








/ r lia=vbP UOJyzb. 


FT 


VARSPLIC 


4757 


4761 


TAHKS -> SKRRK (in isoform 2) . 


FT 








/FTId=VSP_003926. 


FT 


VARSPLIC 


4762 


5147 


Missing (in isoform 2) . 


FT 








/FTId=VSP_003927. 


SQ 


SEQUENCE 


5147 


AA; 563537 


MW; CD5D84990498CD3C CRC64 ; 


Query Match 




5.4%; 


Score 320; DB 1; Length 5147; 



Best Local Similarity 21.8%; Pred . No. 0.00013; 

Matches 260; Conservative 152; Mismatches 440; Indels 340; Gaps 61 

Qy 8 PLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPAAGLSAAP 67 

Db 260 PSLPSPSKPPIQQPTPGKPPAQQPGHEKSQPG PAKPPAQPSGLT 3 03 

Qy 68 VPTAPAAGA PLMDFGNDFVP PAPRGPLPAAPP V 100 

III hi I I I II II 

Db 304 KPLAQQPGTVKPPVQPPGTTKPPAQPLG--PAKPPAQQTGSEKPSSEQPGPKALAQPPGV 361 

Qy 101 --APERQPSWDPSPVSSTVPAPSPLS AAAVSPSKLPEDDEPPA RPP 144 

I =11 I : I I Ih II hi | :|h :|| 

Db 3 62 GKTPAQQPG-PAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPAQTKPPSQQPGSTKPP 420 

Qy 145 P - - P P PAS VS PQAE PVWTPPAPAP - AAP PSTPAAPKRRGSSGAWXXXXKI MDLKEQPGN 201 

I I M III Ih I H II 
Db 421 PQQPGPAKPSPQQPGSTKPPSQQPGSAKPSA QQPSP 456 

Qy 2 02 TISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKE 261 

: h I : :| I I : I i = I h I I = I 

Db 457 AKPSAQQ- FTKPVSQTGFGKPLQPPTVSPSAKQPPSQGLPKTI CPL CNTTELLLH 510 

Qy 262 VSEKA KTLLI DRDLTEFSE LEYSEMGSSFSVSPKA 296 

Db 511 VPEKANFNTCTECQTTVCSLCGFNPNPHLTEAKEWLCLNCQMKRALGGDLAPVPSSPQPK 570 

Qy 297 ESAVIVANPREEI IVKNKDEEEKLVSNNILHXQQEL PTALTKLVKED 343 

Ml ::|: : III' |:| | |||: 

Db 571 L KTAP VTTTSA VS KS S P Q PQQTS P KKDAA P K QDLSKAPEPKKPPPLVKQP 620 

Qy 344 EWSSEKAK DSFNEKRVAVEAPMREE YADFKPFERVWEVKDSKEDSDM 391 

: I M II - I : I II II : I I :h 

Db 621 TLHGSPSAKAKQPPEADSLSKPAPPKEPSVPSEQDKAPVADDKP KQPKMVKPTTDL 676 



Qy 



3 92 LAAGGKI ESNLESKVD KKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYI 448 



Db 


677 


VSS SSATTKPDI PSSKVQSQAEEKTTPPLKTDSAKPSQSFPPTGEKV 


723 


Qy 


449 


TCAPFN PAATES I ATN I F PLLE DPTSENXTDEKKI EEKKAQI VTEKN 


495 


Db 


724 


lh lh 1 - | | || || I MM 
- -TPFDSKAI PRPASDSKI ISHPGPSSESKGQKQVDPV QKKEEPKKAQTKMSPK 


775 


Qy 


496 


TSTK TSNPFFVAAQDSETDYVTTD NLTKVTEEWANMPEGLT 


537 


Db 


776 


1 1 1 1 1 : M =|= | : 1 
PDAKPMPKGSPTPPGPRPTAGQTVPTPQQSPKPQEQSRRFSLNLGSITD APKSQPTT 


832 


Qy 


538 


PDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPA- -AQLCPSFEESEATPSP 
III III : : M :: : | :| | : | hi 


595 


Db 


833 


P QET VTGKL FGFGAS I - F S QAS NL I S TAGQ PG P HS Q SG PGAPMKQA - PA P 


880 


Qy 


596 


VLPDIVMEAPLNSA-VPSAGASVI QPSSSPLEASSVNYESI K- -HEPENPPPY 


645 


Db 


881 


I 1- III 1 | : : || : : : | : ||| 
SQPPTSQGPPKSTGQAPPAPAKSIPVKKETKAPAAEKLEPKAEQAPTVKRTETEKKPPPI 


940 


Qy 


646 


EEAMSVSLKVSGI KEE I KEPENINAALQETEAPYI S I ACDLI K- ETKLSAEPAPDFSDYS 


704 


Db 


941 


KDSKSLT AEPQKAVLPTKLEKSPKPESTCPLCKTELNIGSKDPPNFNTCT 


990 


Qy 


705 


EMAKVE QP VPDHSELVE DSS PDSE P VDLFSDDS 


737 


Db 


991 


1 MM: | | : h : :: | 
ECKNQVCNLCGFNPTPHLTENCQTQRAI SGQLGDI RKMPPAPSGPKAS PMPVPTESSSQK 


1050 


Qy 


738 


- - 1 PDVPQ KQD ETVML- - VKESLTETSFESMI EYENKEKLSALPPEGG 


781 


Db 


1051 


1 II IM 1 IM IIMh |: : |:: | | : 
TAVP - - PQVKLVKKQEQEVKTEAEKVILEKVKETLSMEKI PPMVTTDQKQEESKLEKDKA 


1108 


Qy 


782 


KPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVYSNDDLFISKEAQIRET 
1 1 1 1 hhl : Ml II hi M = lh 


841 


Db 


1109 


SALQE - - KKPLPEEK- KLIPEE-EKI RSEEKKPLLEEKKPTP EDKKLLPEAKTSAP 


1160 


Qy 


842 


ETFSD SSPIEI IDE-FPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGA-GSLP 


896 


Db 


1161 


1 1 = = l M =: II 1 :::: | | ||| 
EEQKHDLLKSQVQIAEEKLEGRVAPKTVQEGK QPQTKMEGLPSGTPQSLP 


1210 


Qy 


897 


CTELPHDLSLKNI QPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQ 

• 1 = 1 1 II 1 : h :-h 1 M |- 


947 


Db 


1211 


KED DKTTKTI KEQPQPPCTAKPDQEKEDDKSDTSSSQQPKSPQGLSDTGYSSDGI SS 


1267 


Qy 


948 


- - AE I ES I VKPKVLEKEAEKKLPSDTEKEDRS PSA I FSADLGK- - TSWDLL 995 




Db 


1268 


M 1- lh 1 1 h - Mh Ml 1 = :| = M 
SLGEI PSLIPTD- -EKDILKGLKKDSFSQESSPSS- -PSDLAKLESTVLSIL 1315 





RESULT 12 
PCLO_MOUSE 

ID PCLO_MOUSE STANDARD; PRT; 503 8 AA . 

AC Q9QYX7; Q9QYX6; Q9QZ JO ; 

DT 28-FEB-2003 (Rel . 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Piccolo protein (Presynaptic cytomatrix protein) (Aczonin) (Brain- 
DE derived HLMN protein) . 
GN PCLO OR ACZ . 



OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=1009 0; 

RN [1] 

RP SEQUENCE FROM N.A., SUBCELLULAR LOCATION, ALTERNATIVE SPLICING, 

RP TISSUE SPECIFICITY, AND INTERACTION WITH PROFILIN. 

RC TISSUE=Brain; 

RX MEDLINE=99439764; PubMed=10508862 ; 

RA Wang X. , Kibschull M . , Laue M.M., Lichte B., Petrasch-Parwez E., 

RA Kilimann M.W. ; 

RT900XJ£czonin, ^ifiSWr^dHpangtCv^ gtta£f BlcJiE^apgoffeinJafGpresynapt iDOaOTYSfe 66 

RT zones, shares homology regions with rim and bassoon and binds 

RT profilin. "; 

RL J . Cell Biol. 147:151-162(1999). 

RN [2] 

RP REVISIONS. 

RC TISSUE=Brain ; 

RA Kilimann M. ; 

RL Submitted (AUG-2001) to the EMBL/ GenBank/DDB J databases. 

RN [3] 

RP SEQUENCE OF 4502-4682 FROM N . A. 

RC TISSUE=Brain ; 

RL Submitted (OCT-1999) to the EMBL/ GenBank/DDB J databases. 

CC -!- FUNCTION: May act as a scaffolding protein involved in the 

CC organization of synaptic active zones and in synaptic vesicle 

CC trafficking. 

CC -!- SUBUNIT: Interacts with Rabacl/Pral and profilin. 

CC -!- SUBCELLULAR LOCATION: Concentrated at the presynaptic side of 

CC synaptic junctions. 

CC -!- ALTERNATIVE PRODUCTS: 

CC E-Gent=Altaftnatt$#§ splicing; Named ibcbifidrmg=ealcium and 

CC Name=l; 

IsoId=Q9QYX7-l ; Sequence=Displayed; 

CC Name=2 ; 

cc IsoId=Q9QYX7-2; Sequence=VSP_0 03 928 , VSP_003929; 

CC -!- TISSUE SPECIFICITY: Highly expressed in brain. Low levels found in 

CC stomach. Not detected in other tissues analyzed including adrenal 

CC gland, testis and pancreas. 

CC -!- DOMAIN: C2 domain 1 is 

CC phospholipids. Calcium binds with low affinity but with high 

CC specificity and induces a large conformational change. 

CC -!- SIMILARITY: Contains 2 C2 domains. 

CC -!- SIMILARITY: Contains 1 PDZ/DHR domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 6 eH g 2 r 

DR EMBL; Y19185; CAB60731.2; -. 

DR EMBL ; Y19186; CAB60732.2; -. 

DR EMBL; AF181269; AAD55786.2; -. 



DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



HSSP; P04410; 1A25. 
MGD; MGI: 1349390; Pclo. 

GO; GO: 0045202; C: synaptic junction; IDA. 

GO; GO: 0005509; F:calcium ion binding activity; ISS. 

GO; GO:0005544; F : calcium- dependent phospholipid binding acti. . . ; ISS . 

GO; GO: 0005522; F:profilin binding activity; IDA . 

GO; GO:0007010; P : cytoskel eton organization and biogenesis; IDA. 

GO; GO: 0016080; P: synaptic vesicle targeting; NAS . 

InterPro; IPR000008; C2 . 

InterPro; IPR001478; PDZ . 

Pfam; PF00168; C2 ; 2. 

Pfam; PF00595; PDZ ; 1. 

SMART; SM0023 9; C2 ; 2. 

SMART; SM00228; PDZ; 1. 

PROSITE; PS 0 04 99; C2_DOMAIN_l; 1. 

PROSITE; PS50004; C2_DOMAIN_2 ; 2. 

PROSITE; PS50106; PDZ; 1. 

Calcium/phospholipid-binding; Metal -binding; Zinc; Zinc-finger; 
Repeat; Alternative splicing. 

10 X 10 AA TANDEM APPROXIMATE REPEATS OF 
P-A-K-P-Q-P-Q-Q-P-X. 
C4 -TYPE (POTENTIAL) . 
C4 -TYPE (POTENTIAL) . 
POLY -PRO. 
PDZ. 

C2 DOMAIN 1 . 
C2 DOMAIN 2 . 

TKPTN -> SKRRK (in isoform 2) . 
/FTId=VSP_003928 . 
Missing (in isoform 2) . 
/FTId=VSP_003929 . 
547600 MW; DADA46 0CF3B4 08 88 CRC64 ; 



DOMAIN 

ZN_FING 

ZN_FING 

DOMAIN 

DOMAIN 

DOMAIN 

DOMAIN 

VARSPLIC 

VARSPLIC 



371 

502 
967 
2305 
4394 
4607 
4922 
4829 

4834 



470 

526 
990 
2329 
4488 
4705 
5012 
4833 

5038 



SEQUENCE 503 8 AA; 



Query Match 5.3%; Score 315; DB 1; Length 503 8; 

Best Local Similarity 22.0%; Pred. No. 0.00019; 

Matches 254; Conservative 136; Mismatches 435; Indels 328; Gaps 



57; 



Qy 


57 


Db 


288 


Qy 


104 


Db 


348 


Qy 


156 


Db 


408 


Qy 


210 


Db 


457 


Qy 


7 


Db 


517 



I I 



I =1 



-PSWDPSP-VSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQ-A 155 
1= I I :| II h I III I :| I I || 



:| 



III 



II 



I I I 



II I 



Qy 2 97 ESAVIV ANPREEI IVKN KDEEEK LVSNNILHX QQE 331 

I hh'h I I I I II || I 

Db 577 ASVQPATASKSPVPSQQASPKKELPSKQDSPKAPESKKPPPLVKQPTLHGPTPATAPQPP 636 

Qy 332 LPTALTKLV--KEDEWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS 38 9 

'< M I h III lh II =| : : - || 

Db 637 VAEALPKPAPPKKPSAALPEQAK APV ADVEPKQP- -KTTETLTDS 679 

Qy 390 DMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYIT 449 

II = I hi 1=1 -h III I : I 

Db 68 0 PSSAAATSKPAILSSQVQ AQAQVTTAPPLKTDSAKTSQSFPPTGD T 725 

Qy 450 CAPFN PAATES I ATNI FPLLEDPTS ENXTD - EKKI EEKKAQ I VTEKNTSTKTSKf 502 

I ^ lh I == lltt= =|| I I I I I ||K 
Db 726 I TPLDSKAMPRPASDSKI VSH PGPTSESKDPVQKKEEPKKAQTKVTPKPDTK 777 

Qy 503 PFFVAAQDSETDYVTTDNLTKVTEEV VANMPEGLTPDLVQEACE 54 6 

Db 778 PVPKGSPTPSGTRPTTGQATPQSQQPPKPPEQSRRFSLNLGGIADAPKS-QPTTPQET- - 834 

Qy 547 SELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLP DIVM 602 

III : : | | :: I I I : III I 

Db 835 VTGKLFGFGASI -FSQASNLIS TAGQQAPHPQTGPAAPSKQAP P PSQTLAA 884 

Qy 603 EAPLNSA- -VPSAGASVIQ PSSSPLEASSVNYESI KHEP ENPP 643 

= 11 III I h: Ml :: || | || 

Db 8 85 QGPPKSTGQHPSAPAKTTAVKKETKGPAAENLEAKPAQAPTVKKAEKDKKHPPGKVSKPP 944 

Qy 644 PYEEAMSVSLKVSGI KEEI KEPENINAALQETEAPYI SI ACDLI K- ETKLSAEPAPDFSD 702 

I XS Ml I = = || h | : :1a |:|:ter. 

Db 945 P TEPEKAVLAQKPDKTTKPKPACPLCRTELNVGSQDPPNFNT 986 

Qy 7 03 YSEMAKVE QPVPDHSELVE DSSPDSEPVD 731 

: I I I :|= I I I : || 

Db 987 CTECKNQVCNLCGFNPTPHLTEIQEWLCLNCQTQRAISGQLGDMDKMPPASSGPKASPVP 1046 

Qy 732 LFSD DS I PDVPQKQDETV MLVKESLT 757 

= = :hh = = I : :| 

Db 1047 APAEPPPQKTPTAAHAKGKKKETEVKAETEKQIPEKETPSIEKTPPAVATDQKLEESEVT 1106 

Qy 758 ETSFESMI EYENKEKLSAL PPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSK 809 

: I : h II II I II I hi I II : h 

Db 1107 KSLVSVLPEKKPSEEEKALPADKKEKKPPAAEAPPLEEKKPI PDDQK--LPPDAKPSASE 1164 

Qy 810 KEKIPLQMEELSTAVYSNDDLFISKEAQIR-ETETFSDV EI IDEFPTLISSKEBKFS 868 

h : : I I :: I I I : I I hi- | = 
Db 1165 GEE KRDL.LKAHVQI PEEGP IGKVASLACEGEQQPDTRPEDLPGATPQTLPKD 1216 

Qy 869 KLARE YTDLEVSHKSE I ANAPDGAG - - SLPCTELPHDLS - - LKN I QPKVEEKI S FSDDFS 924 

h :|= hill II M : III : :| =| 

Db 1217 RQKESRDVTQPQAEGTAKEGRGEPSKDRTEKEEDKSDTSSSQQPKSPQGLS-DTGYS 1272 

Qy 925 KNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSA 984 

: I : I M II 1 = = lh |||=:= Mh = 

Db 1273 SDGISGS LG E I PSLI PSD- - EKDLLKGLKKDSFSQESSPSS - - PS 1313 

Qy 985 DLGK- -TSWDLL 995 



Db 1314 DLAKLESTVLSIL 1326 



RESULT 13 
ANK2 HUMAN 



ID AN K2 _HUMAN STANDARD; PRT; 3 924 AA. 

AC Q01484; Q01485; 

DT 01-APR-1993 (Rel . 25, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Ankyrin 2 (Brain ankyrin) (Ankyrin B) (Ankyrin, nonerythroid) . 

GN ANK2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCB I _Tax I D = 9 6 0 6 ; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3) . 

RC TISSUE-Brain stem; 

RX MEDLINE=91302466; PubMed= 183 0053 ; 

RA Otto E . , Kunimoto M., McLaughlin T. , Bennett V. ; 

RT "Isolation and characterization of cDNAs encoding human brain 

RT ankyrins reveal a family of alternatively spliced genes ; 

RL J. Cell Biol. 114:241-253(1991). 

RN [2] 

RP REVISIONS . 

RA Carpenter S . ; 

RL Submitted (MAY-1999) to the EMBL/GenBank/DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM 1) . 

RC TISSUE=Brain stem; 

RX MEDLINE=94075409; PubMed=8253844 ; 

RA Chan W. , Kordeli E. , Bennett V. ; 

RT "44 0-kD ankyrinB: structure of the major developmental ly regulated 

RT domain and selective localization in unmyelinated axons . " ; 

RL J. Cell Biol. 123:1463-1473(1993). 

RN [4] 

RP SEQUENCE OF 4 63-4 95 FROM N.A. 

RX MEDLINE=92009921; PubMed-18333 08 ; 

RA Tse W.T., Menninger J.C., Yang-Feng T.L., Francke U. , Sahr K.E., 

RA Lux S.E., Ward D.C., Forget B.G. ; 

RT "Isolation and chromosomal localization of a novel nonerythroid 

RT ankyrin gene . " ; 

RL Genomics 10:858-866(1991). 

CC -!- FUNCTION: Attach integral membrane proteins to cytoskeletal 
CC elements. Also bind to cytoskeletal proteins 

CC - ! - ALTERNATIVE PRODUCTS : 

CC Event ^Alternative splicing; Named isoforms=3; 

CC Name=l; 

CC IsoId=Q01484-l ; Sequence-Displayed; 

CC Name=2; 

CC IsoId=Q01484-2 ; Sequence=VSP_000267, VSP_000268; 

CC Name=3 ; 

CC IsoId=Q01484-3 ; Sequence=VSP_000268 ; 

CC -!- TISSUE SPECIFICITY: PLASMA MEMBRANE OF NEURONS AS WELL AS GLIAL 
CC CELLS THROUGHOUT THE BRAIN. 





-!- PTM: PHOSPHORYLATED AT MULTIPLE SITES BY DIFFERENT PROTEIN KINASES 


L-L 


AND EACH PHOSPHORYLATION EVENT REGULATES THE PROTEIN'S STRUCTURE 


LL 


AND FUNCTION (POTENTIAL) . 




r*n 


-!- SIMILARITY: Contains 23 ANK 


repeats . 


LL 


-!- SIMILARITY: Contains 1 death 


domain. 


CC 
CC 






This SWISS-PROT entry is copyright. It is produced through a collaboration 


CC 


between the Swiss Institute of 


Bioinf ormatics and the EMBL outstation - 


cc 


the European Bioinf ormat ics Institute. There are no restrictions on its 


cc 


use by non-profit institutions as long as its content is in no way 


cc 


modified and this statement is not removed. Usage by and for commercial 


cc 


entities requires a license agreement (See http://www.isb-sib.ch/announce/ 


cc 


or send an email to 32Y 00X8 


sib.ch)MIDl 094559 


cc 

DR 






EMBL ; X56957; CAA40278.1; 




DR 


EMBL ; X56958; CAA40279.2; -. 




DR 


EMBL; Z26634; CAB42644.1; -. 




DR 


EMBL; M37123; AAA62828.1; -. 




DR 


PIR; S37431; S37431. 




DR 


HSSP; P42771; 1DC2 . 




DR 


Genew; HGNC:4 93; ANK2 . 




DR 


MIM; 106410; - . 




DR 


InterPro; IPR002110; ANK. 




DR 


InterPro; IPR000488; Death. 




DR 


InterPro; IPR000906; ZU5 . 




DR 


Pfam; PF00023; ank; 23. 




DR 


Pfam; PF00531; death; 1. 




DR 


Pfam; PF00791 ; ZU5 ; 1 . 




DR 


PRINTS; PR01415; ANKYRIN. 




DR 


SMART; SM00248; ANK; 22. 




DR 


SMART; SM00005; DEATH; 1. 




DR 


SMART; SM00218; ZU5 ; 1. 




DR 


PROSITE; PS50088; ANK REPEAT; 20 




DR 


PROSITE; PS50297; ANK REP REGION; 1. 


DR 


PROSITE; PS50017; DEATH DOMAIN; 


1. 


KW 


Cytoskeleton; Alternative splicing; Repeat; ANK repeat; 


KW 


Pho sphory 1 a t ion . 




FT 


REPEAT 63 92 ANK 


1. 


FT 


REPEAT 96 125 1200XA K 


2. VAPAPAX 


FT 


REPEAT 129 158 ANK 


3 . 


FT 


REPEAT 162 191 ANK 


4 . 


FT 


REPEAT 193 22 0 ANK 


5. 


FT 


REPEAT 232 261 ANK 


6. 


FT 


REPEAT 265 294 ANK 


7 . 


FT 


REPEAT 298 327 ANK 


8. 


FT 


REPEAT 331 360 ANK 


9. 


FT 


REPEAT 364 393 ANK 


10. 


FT 


REPEAT 397 426 ANK 


11. 


FT 


REPEAT 430 459 ANK 


12 . 


FT 


REPEAT 4 63 4 92 ANK 


13 . 


FT 


REPEAT 4 96 525 ANK 


14 . 


FT 


REPEAT 52 9 558 ANK 


15. 


FT 


REPEAT 562 591 ANK 


16. 


FT 


REPEAT 595 624 ANK 


17 . 


FT 


REPEAT 628 657 ANK 


18 . 


FT 


REPEAT 661 690 ANK 


19 . 


FT 


REPEAT 694 723 ANK 


20 . 



FT 


REPEAT 


727 


756 


ANK 21 . 


FT 


REPEAT 


760 


789 


ANK 22 . 


FT 


REPEAT 


793 


822 


ANK 23 . 


FT 


DOMAIN 


1773 


1950 


REPEAT-RICH REGION. 


FT 


REPEAT 


1773 


1784 


REPEAT A. 


FT 


REPEAT 


1785 


1796 


REPEAT A. 


FT 


REPEAT 


1797 


1808 


REPEAT A. 


FT 


REPEAT 


1809 


182 0 


REPEAT A. 


FT 


REPEAT 


1821 


1832 


REPEAT A. 


FT 


REPEAT 


1833 


1844 


REPEAT A. 


FT 


REPEAT 


1845 


1856 


REPEAT A. 


FT 


REPEAT 


1857 


1867 


REPEAT A (APPROXIMATE) . 


FT 


REPEAT 


1868 


1879 


REPEAT A. 


FT 


REPEAT 


1880 


1891 


REPEAT A. 


FT 


REPEAT 


1892 


1902 


REPEAT A (APPROXIMATE) . 


FT 


REPEAT 


1903 


1914 


REPEAT A. 


FT 


REPEAT 


1915 


1926 


REPEAT A. 


FT 


REPEAT 


1927 


1938 


REPEAT A. 


FT 


REPEAT 


1939 


1950 


REfiffiHT A. | : 


FT 


DOMAIN 


3536 






FT 


VARSPLIC 


1039 


1039 


Q -> QFLGKLHLPTAPPPLNEGESLVSRILQLGPPGTK 


FT 








(in isoform 2) . 


r 1 








/FTId=VSP 000267. 


FT 


VARSPLIC 


1444 


3528 


Missing (in isoform 2 and isoform 3) . 


FT 








/FTId=VSP_000268 . 


FT 


CONFLICT 


475 


476 


GQ -> PE (IN REF . 4) . 


FT 


CONFLICT 


971 


971 


I -> S (IN REF. 1) . 


FT 


CONFLICT 


3581 


3582 


QY -> HA (IN REF. 1) . 


FT 


CONFLICT 


3586 


3586 


I -> Y (IN REF. 1) . 


SQ 


SEQUENCE 


3924 


AA; 430337 


MW; 52AC496C428E29D2 CRC64 ; 



oox 



Query Match 5.0%; Score 299; DB 1 ; Length 3 924; 

Best Local Similarity 21.5%; Pred. No. 0.00059; 

Matches 258; Conservative 167; Mismatches 417; Indels 358; Gaps 62; 



QY 
Db 

Qy 
Db 

Qy 

Db 



14 DSPP- 



I 



- RPQPAFKYQFV$EPEDEEEEE$$EEEDED$DL$ EE$F^ERK 5 8 9 8 B 0XT 



164 8 DIP PDETQS TQKQHKPSLG I KKP VRRKLKEKQKQKEEGLQASAEKAELKKGS S EESLGED 1707 

59 PAAGLSAAPVPTAPAAGAPLMD FGNDFVPPAPRG 92 

I Ih HI i HI- | :| :| 

1708 P - - GLAPE PLPTVKAT - S PLI EETP I GS I KDKVKALQKRVEDEQKGRSKLP I RVKGKEDV 1764 

93 PLPAA-PPVAPERQPSWDPSP VSSTVPAPSPL 123 

Mill: II III II Ih 

1765 PKKTTHRPHPAASPSLKSERHAPGSPSPKTERHSTLSSSAKTERHPPVSPSSKTEKHSPV 1824 



Qy 

Db 

Qy 

Db 

Qy 



124 SAAA VSPSKLPEDDEPPA RPPPPPPASVSPQAEPVW 159 

I : I I I II I I I II h : : M 

182 5 SPSAKTERHSPASSSSKTEKHSPVSPSTKTERHSPVSSTKTERHPPVSPSGKTDKRPPV- 1883 

160 TPPAPAPAAPPSTPAAPKRR GSSGAWXXXXKIM TISAGQAdFPSVLILE 216 

: I II =1 ==l II I II :| - I 

1884 SPSGRTEKHPPVSPGRTEKRLPVSPSGRTD KHQPVSTAGKTEKHLPVSPSG 1934 



217 TAASXPSLSPLSAAS-FKEHEYLGNLSTVLPT- 
I =11 I :| : I 



- EGTLQEN VS EAS KE VS E KAKT 268 

■ I h I hill 



1935 KTEKQPPVS PTSKTER I EETMS VRELMKAFQSGQD PS KHKTGLFEHKSAKQKQ PQEKGKV 1994 



269 LLIDRDLTEFSELEYSEMGSSFSVSPKAES- -AVIVANPREEI IVKNKDE- - 316 

= 1 h = =1 = = I h ||| | |:: | || :| 

1995 RVEKEKGPILTQRE-AQKTENQTIKRGQRLPVTGTAESKRGVRVSS IGVKKEDAAG 2049 

317 -EEKLVSNNI LHXQQELPTALTKLVKE DEW 346 

: lh:|: I I - 1 = 1 h I ||: 

2 05 0 GKEKVLSHKI PEPVQS VPEEESHRESEVPK- - EKMADEQGDMDLQI SPDRKTSTDFSEVI 2107 

347 SSE-KAKDSFNEKRVAVEAPMREEYAD FKPFERVWEVKDSKED 388 

I : I : : l = : I : : I I I : : |:: 

2108 KQELEDNDKYQQFRLS EETEKAQLHLDQVLTS PFNTTF PLD YMKDEFLPALSLQSGALDG 2167 

389 - SDMLAAGGKI ESNLES KVDKKCFADSLEQTNHE KDSESSNDDT FPSTPEGI KDRS 444 

2168 SSESLKNEGVAGSPCGSLMEGTPQISSEESYKHEGIA^^ 2225 

445 GAY I TCAPFNPAATES I ATNI F PLLEDPTS ENXTDEKKI EEKKAQ I VTEKNTSTKTSNPF 504 

I = I I I lh- I ||= I : : Ml- 

2226 G ETKESTKTETTTEIRSEKEHPTTKDITGGS- -EERGATVTEDSETSTESFQKE 2277 

5 05 FVAAQDSETDYVTTDNLTKVTEEWA-NMPEGLTPDLVQEACESELNEVTGTKIAYETKM 563 

: I hi : I I III HI I h h = | 

2278 ATLGSPKDTSPKRQDDCTGSCSVALAKETPTGLT EEAACDEGQRTFGSS -AHKT- - 2330 

564 DLVQTSEVMQESLYPAAQLCPSFEESEATPSPVL PDI VMEAP- -LNSAV 610 

M III : :|::| | | | |: :| | |: 

2331 QTDSEAQES TATSDETKALPLPEASVKTDTGTESKPQGVIRSPQGLELAL 238 0 

1 PSAGASVIQ PSSSPLEASSVNYESIKHE-PEN- -PPPYEEA-MSVSLKVBCE $582 

I I = h I I I I I - h | = : | | :|: ||: | 

2381 PSRDSEVLSAVADDSLAVSHKDSLEASPVLEDNSSHKTPDSLEPSPLKESPCRDSLESSP 2440 

658 IKEEIKE PEN - - I NAALQETE - - APYI S I ACDLI KETKLSAEPAPDFSDYSEMAKVE 710 

- : : I I = = Ih HI h |: = = || | :| :| 

2441 VEPKMKAGIFPSHFPLPAAVAKTELLTEVASVRSRLLRD PDGS - -AEDDSLE 2490 

711 QPVPDHSELVEDS SPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFESMI 765 

I ■ hi I llh : I : I :: I | 
2491 Q TSLMESSGKSPLSPDTPSSEEVSYEVTPKTTDVSTPKPAVIHECAEED 2539 

766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKK EKI PLQMEELS 821 

,=IIH N 111= II * 11= :h MM 

2540 DSENGEKKRFTPEE T EMFKMV TKI KMF - DELEQEAKQKRDYKKEPKQEESSS 2589 

822 TAVYSNDDLF I SKEA- QI RETETFSDSSPI EI IDEFPTLI SSKTDSFSKLAREYTDLEVS 880 
• hi I = = = I : II : I h : h : | : 
2590 S SDPDADCS VDVDEPKHTGSGEDESGV PVLVTSESRKVSSSSES 2633 

881 HKSEIANAPDGAGS LPCTELPHDLSLKN IQPKVEEKISF- -SDD 922 

o _ ' hi M I | : || : : || | == =| :=| 

2634 -EPELAQLKKGADSGLLPEPVIRVQPPSPLPSSMDSNSSPEEVQFQPWSKQYTFKMNED 2692 

923 FS KNGSATSKA/LLLP PDVSALGHTQAE I ES I VKPKVLEKEAEKKLPSD TEKEDRS 977 

2693 TQEEPGKSEE EKDSESHIAEDRHAVST^EDRS 2725 



RESULT 14 
MAPB HUMAN 



ID MAPB__HUMAN STANDARD; PRT; 2468 AA . 

AC P46821; 

DT 01-NOV-1995 (Rel . 32, Created) 

DT 01-NOV-1995 (Rel. 32, Last sequence update) 

DT 16-OCT-2001 {Rel. 40, Last annotation update) 

DE Microtubule-associated protein IB (MAP IB) [Contains: MAPI light chain 

DE LCI] . 

GN MAP1B . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Fetal brain; 

RX MEDLINE=95104835; PubMed=78 062 12 ; 

RA Lien L.L., Feener C. , Fischbach N. , Kunkel L.M.; 

RT "Cloning of human microtubule-associated protein IB and the 

RT identification of a related gene on chromosome 15."; 

RL Genomics 22:273-280(1994). 

CC -!- FUNCTION: THE FUNCTION OF BR PS IS ESSENTIALLY UNKNOWN . 

CC PHOSPHORYLATED MAP1B MAY PLAY A ROLE IN THE CYTOSKELETAL CHANGES 

CC THAT ACCOMPANY NEURITE EXTENSION. POSSIBLY MAP1B BINDS TO AT LEAST 

CC TWO TUBULIN SUBUNITS IN THE POLYMER, AND THIS BRIDGING OF SUBUNITS 

CC MIGHT BE INVOLVED IN NUCLEATING MICROTUBULE POLYMERIZATION AND IN 

CC STABILIZING MICROTUBULES. 

CC -!- SUBUNIT: 3 DIFFERENT LIGHT CHAINS , LCI, LC2 AND LC3 , CAN ASSOCIATE 
CC WITH MAP1A AND MAP1B PROTEINS. 

CC -!- DOMAIN: Has a highly basic region with many copies of the sequence 
CC KKEE and KKEl/V, repeated but not at fixed intervals, which is 

CC responsible for the binding of MAP1B to microtubules. 

CC -!- PTM: LCI is coexpressed with MAP1B. It is a polypeptide generated 
CC from MAP1B by proteolytic processing. It is free to associate with 

CC both MAP1A and MAP1B. It interacts with the amino terminal region 

CC of MAP1B (By similarity) . 

CC -!- SIMILARITY: TO MAP1A. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L06237; AAA18904.1; -. 

DR Genew; HGNC:6836; MAP1B. 

DR MIM; 157129; -. 

DR GO; GO: 0005875; C : microtubule associaEMBL/omplex; TAS . 

DR InterPro; IPR000102; MAPlB_neuraxin . 

DR Pfam; PF00414; MAPlB_neuraxin ; 10. 

DR PROSITE; PS00230; MAP1B_NEURAXIN ; 6. 

KW Microtubules; Repeat; Phosphorylation. 



FT CHAIN ? 2468 MAPI LIGHT CHAIN LCI . 

FT REPEAT 1878 1894 MAP1B 1. 

FT REPEAT 1895 1911 MAP1B 2. 

FT REPEAT 1912 1928 MAP1B 3. 

FT REPEAT 1929 1945 MAP1B 4. 

FT REPEAT 1946 1962 MAP1B 5. 

FT REPEAT 1963 1979 MAP1B 6. 

FT REPEAT 1997 2013 MAP1B 7. 

FT REPEAT 2014 2030 MAP1B 8. 

FT REPEAT 2031 2047 MAP1B 9. 

FT REPEAT 2048 2064 MAP1B 10. 

FT DOMAIN 589 790 LYS-RICH (HIGHLY BASIC, CONTAINS MANY 

FT KKEE AND KKEl/V REPEATS) . 

SQ SEQUENCE ■ 2468 AA; 270618 MW; 540839CBDF09D461 CRC64 ; 

Query Match 5.0%; Score 296.5/ DB 1; Length 24 68; 

Best Local Similarity 21,5%; Pred. No. 0.00041; 

Matches 264; Conservative 148; Mismatches 459; Indels 355; Gaps 

Qy 2 EDLDQSPLVSSS-DSPPRPQPAFKYQ FVREP EDEE 

Db 956 EDGEEHVCVSASKHSPTEDEESAKAEADAYIREKRESVASGDDRAEEDMDEAIEKGEAEQ 

Qy 3 6 EEEEEEEEDEDEDLEELEV-LERKPAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPL 

I hi I I I II :| | | 

Db 1016 SEEEADEEDKAEDAREEEYEPEKMEAEDYVMAVVDKAAEAGGAEEQYGFLTTPTKQLG- - 

Qy 95 PAAPPVAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQ 

: I hh I hi I II hi I h 

Db 1074 AQS PGR EPA - -SSI HDETLPGGSESEATA SDEENREDQPEEFTATSGYT 

Qy 155 AEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQE D 

I I III :| : II II 

Db 1121 QSTIEISSEPTPMDEMSTP RDVMSDETNNEETESPSQEBFSYSK 

Qy 210 FPSVLLETAASXPS -LSPLSAASFKE HEYLGNLSTVLPTEGTLQENVSEAS 

= 11 I h -Ih II =1 : Ih I :: I - 

Db 1165 YESSLYSQEYSKPADVTPLN- -GFSEGSKTDATDGKDYNASASTISPPSSMEEDKFSRSA 

Qy 260 KE VS E KAKTLL I DRDLTE F SELEYSEMGS SFS 

M M I I =1 III: H HI 

Db 1223 LRDAYCSEV- -KASTTLDIKDSISAVSSEKVSPSKSPSLSPSPPSPLEKTPLGERSVNFS 

Qy 2 92 VSP KAESAVI VANPR- -EEI IVKN- -KDEEEKL VSNNILHX 

= =l 11= I = 1 :|- - | = = I h = I 

Db 12 81 LTPNE I KVSAEAE VAP VS PEVTQE WE EHCAS PEDKTLE WS PSQS VTGSAGHTP Y YQS P 

QY 32 9 QQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADFK-PFERVWEVK 

Ml = : I I I : |= :| I I I hh| 

Db 1341 TDEKSSHLPTEV- - IEKPPAVPVSFEFSDAKDENERASVSPMDEPVPDSESPIEKVLSPL 

Qy 3 84 DSKEDSDMLAAGGKI ESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDR 

I • 111=11 I || : : | | : 
Db 13 99 RS PPLIGSESAYESFLSA- -DDKASGRGAESPFEEKSGKQGSPDQVSPVSE 

Qy 444 SGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKK IEEKK A 



59; 
35 

1015 
94 

1073 
154 
1120 
209 

116a50XG 

259 

1222 

291 

1280 

328 

1340 

383 

1398 

443 

1447 

488 



Db 1448 MTSTSLYQDKQEGKSTDFAPIKEDFGQEKKTDDVEAMSSQPALALDERKLGDVSPT 1503 

Qy 48 9 QI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWA--N 531 

II -I I h= I I =1 I = = h II : 

Db 1504 QIDVSQFGSFKEDTKMSI SEGTVSDKSATP- -VDEGVAEDTYSHMEGVASVSTASVATSS 1561 

Qy 532 MPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQES-LYPAAQLCP 584 

II III : : h I- : :||| ||: : |= : || 

Db 1562 FPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTFQETEMSPSKEECPRPMSIS 1620 

Qy 585 SFEESEATPSPVLP-DIVMEAPLNSAVPSAGASVI 618 

I I : =1 II ::| : I = If h 
Db 1621 P PDFS PKTAKSRTP VQDHRS EQSSMS I EFGQES PEQSLAMDFSRQS PDH PTVGAGVL 1677 

QY 619 Q PSSSPLEASSVNYESIKHE- -PENPPPY EEAMSVS LKVSGIKEEIKE 664 

h h hi: I || | :||| :: | 

Db 1678 HITENGPTEVDYSPSDMQDSSLSHKI PPMEEPSYTQDNDLSELISVSQVEASPSTSSAHT 1737 

QY 665 PENINAALQETE AP YISIACDLIKE TKLSAEP A 697 

I I = III II I h : - ||| : 

Db 1738 PSQIASPLQEDTLSDVAPPRDMSLYASLTSEKVQSLEGEKXjSPKSDISPLTPRESSPLYS 1797 

Qy 698 PDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFS 734 

I III : I h II Ml h| | 
Db 1798 PTFSDSTSAVK- EKTATCHS SSSP PIDAASAEPYGFRASVLFDTMQHHLALNR 1849 

QY 73 5 DDSIPDVP QKQDETVMLVKESLTETSFESMI EYENKEKLSALPPE 779 

Ml: || :|| | : | :||: || : 

Db 1850 DLSTPGLEKDSGGKTPGDFSYAYQKPEET TRS PDEEDYDYES YEKTTRTSDV 1901 

Qy 99 0 GGKP YLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVYSNDDLF I SKEAQI R 839 

M I I : : : h I 1=1 II I I : | | : 

Db 1902 GGY - YYEKI ERTTKS PSDSGYSYE - -TI GKTTKTP EDGDYSYE- -I IEKTTRTP 1950 

QY 840 ETETFSDSS P I EI I DEFPTLI SSKTDS FS KIAREYTDLEVSHKSE I ANAPDGAGSLP 896 

I : I II M I I : I h :|:| I : 
Db 1951 EEGGYSYD ISEKTTSPPEVSGYSYEKTERSRRLLDDISNGYDDS 1994 

Qy 897 CTELPHDLSLKNI QPKVEEKI SFS DDF SKNG SATS KVLLLPPDVSALGHTQAE - I ES I VK 955 

: II : : I I I : II II I : || | 

1995 -EDGGHTLGDPSYSYETTEKITSFPESEGYSYETSTKTTRTPDTSTYCYETAEKITRTPQ 2 053 

Qy 956 PKVLEKE AEKKLPSDTEKE 974 

I I I I I Ih - 

Db 2054 ASTYSYETSDLCYTAEKKSPSEARQD 2079 

RESULT 15 
MAPB_RAT 

ipAccMASBoRAT STANDARD; PR A . 

AC P15205; Q62958; Q9ER21; Q9QW92; 

DT 01-APR-1990 (Rel . 14, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Microtubule-associated protein IB (MAP IB) (Neuraxin) [Contains: MAPI 
DE light chain LCI] . 
GN MAP1B . 



OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata ; Euteleostomi ; 

OC Mammalia; Eutheria ; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCB I_TaxI D= 1 0 1 1 6 ; 

RN [1] 

RP SEQUENCE OF 1-142 FROM N.A. 

RC STRAIN-Sprague-Dawley; TISSUE=Test is ; 

RX MEDLINE=96257242; PubMed=8 6662 95 ; 

RA Liu D. , Fischer I.; 

RT "Isolation and sequencing of the 5 1 end of the rat microtubule- 

RT associated protein (MAP1B) -encoding cDNA."; 

RL Gene 172:307-308(1996). 

RN [2] 

RP SEQUENCE OF 96-2459 FROM N.A., DOMAIN, AND INDUCTION . 

RC STRAIN=Sprague-Dawley; TISSUE=Brain, and Glial tumor; 

RX MEDLINE=92347374; PubMed=163 9 092 ; 

RA Zauner W. , Kratz J., Staunton J. , Feick p., Wiche G. ; 

RT "Identification of two distinct microtubule binding domains on 

RT recombinant rat MAP IB." ; 

RL Eur. J. Cell Biol. 57:66-74(1992). 

RN [3] 

RP SEQUENCE OF 1541-2459 FROM N.A., AND TISSUE SPECIFICITY. 

RC TISSUE=Spinal cord; 

RX MEDLINE=90059871; PubMed=2555 15 0 ; 

RA Rienitz A., Grenningloh G., Hermans -Borgmeyer I. # Kirsch J., 

RA Littauer U.Z., Prior P., Gundelfinger E.D., Schmitt B. , Betz H. ; 

RT "Neuraxin, a novel putative structural protein of the rat central 

RT nervous system that is immunologically related to microtubule- 

RT associated protein 5."; 

RL EMBO J. 8:2879-2888(1989). 

RN [4] 

RP DEVELOPMENTAL STAGE, AND PHOSPHORYLATION. 

RX MEDLINE-97405699; PubMed-9260743 ; 

RA Ma D. , Nothias F., Boyne L.J., Fischer I.; 

RT "Differential regulation of microtubule-associated protein IB (MAP1B) 

RT in rat CNS and PNS during development . " ; 

RL J. Neurosci . Res. 49:319-332(1997). 

CC -!- FUNCTION: The function of brain MAPS is essentially unknown. 
CC Phosphorylated MAP1B may play a role in the cytoskeletal changes 

CC that accompany neurite extension. Possibly MAP1B Binds to at least 

CC two tubulin subunits in the polymer, and this bridging of subunits 

CC might be involved in nucleating microtubule polymerization and in 

CC stabilizing microtubules. 

CC -!- SUBUNIT: 3 different light chains, LC1 # LC2 and LC3 , can associate 
CC with MAP1A and MAP1B proteins. 

CC -!- TISSUE SPECIFICITY: Nervous system (spinal cord, brain stem, 

CC cerebellum and cerebrum) . Not expressed in liver, spleen, kidney, 

CC heart or muscle. 

CC -!- DEVELOPMENTAL STAGE: In cerebral cortex, spinal cord and sciatic 
CC nerve levels are high early in development but decrease during 

CC postnatal development and are low in adults. In dorsal root 

CC ganglia levels remain high throughout development. 

CC -!- INDUCTION: By nerve growth factor. 

CC -!- DOMAIN: Has a highly basic region with many copies of the sequence 
CC KKEE and KKEI/V, repeated but not at fixed intervals, which is 

CC responsible for the binding of MAP1B to microtubules. 

CC -!- PTM: LCI is coexpressed with MAP1B . It is a polypeptide generated 
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FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



from MAP1B by proteolytic processing. It is free to associate with 
both MAPI A and MAP1B . It interacts with the amino- terminal region 
of MAP1B (By similarity) . 

- PTM: Phosphorylated. 

- SIMILARITY: TO MAP1A. 

- CAUTION: A C-terminal fragment of this protein (residues 1597 to 
2459) was originally described as neuraxin in Ref.3. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and t EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; U52950; AAB17068.1; -. 
EMBL; X60370; CAC16162.1; -. 
EMBL ; X16623; CAA34620.1; ALT_SEQ. 
PIR; A56577; A56577 . 

InterPro; IPR000102; MAPlB_neuraxin . 
Pfam; PF00414; MAPlB_neuraxin ; 10. 
PROSITE; PS00230; MAP 1 B_NEURAX I N ; 8. 
Microtubules ; Repeat ; Phosphorylation. 
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REPEAT 

REPEAT 

REPEAT 

REPEAT 

REPEAT 

REPEAT 

REPEAT 

REPEAT 

REPEAT 

REPEAT 

DOMAIN 

DOMAIN 

DOMAIN 

CONFLICT 

CONFLICT 

CONFLICT 

CONFLICT 

SEQUENCE 



o 

1869 
1886 
1903 
1920 
1937 
1954 
1988 
2005 
2022 
2039 
559 
588 

2224 
127 
140 
2112 
2169 



2459 
1885 
1902 
1919 
1936 
1953 
1970 
2004 
2021 
2038 
2055 
1035 
786 

2312 
127 
140 
2112 
2169 



MAPI LIGHT CHAIN LCI. 



MAP1B 
MAP1B 
MAP1B 
MAP1B 
MAP1B 
MAP1B 
MAP1B 
MAP1B 
MAP1B 
MAP1B 



1. 
2. 
3. 
4 . 
5. 
6. 
7. 
8. 
9. 
10. 



GLU-RICH. 

LYS-RICH (HIGHLY BASIC, CONTAINS MANY 

KKEE AND KKEl/V REPEATS) . 

LYS-RICH. 

M -> V (IN REF. 1) . 
T -> S (IN REF. 1) . 
R -> K (IN REF . 3) . 
L -> I (IN REF. 3) . 



2459 AA; 269497 MW; 2E3F6872DEDB8BA2 CRC64; 



Query Match 4.9%; Score 289; DB 1; Length 2459; 

Best Local Similarity 20.9%; Pred. No. 0.0008; 

Matches 275; Conservative 158; Mismatches 462; Indels 422; Gaps 



61; 



Qy 
Db 

Qy 

Db 



3 0 EPEDEEEEEEEEEEDEDEDLEELEVLERKPAAGLSAAPVPTAPAAGAPLMDFGNDFVPPA 89 
I I II! Illh - II :: I I I I II : |h 
1004 EAEQSEEEGEEEEDKAEDAREEDHEPDKTEAEDYVMAVVDKAAEAGVTEDQY- -DFL 1058 

90 PRGPLPAAPP- -VAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPP 147 

II I ! hh I hi I II hi | 

1059 ---GTPAKQPGVQSPSREPA-- SSI HDETLPGGSESEATA SDEENREDQPEEF 1106 



Qy 14 8 PASVSPQAEPWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQ 2 07 

1= : M III =1 : I I I 

Db 1107 TATSGYTQSTIEISSEPTPMDEMSTP RDVMTDETNNEETESPSQ 1150 

QY 208 E DFPSVLLETAASXP SLSPLSAASFKE HEYLGNLSTVLPTEGTLQENV 255 

I =11 I I I : II I : H : lh | 

Db 1151 EFVNITKYESSLYSQEYSKPWASFNGLSDGSKTDATDGRDYNASASTISPPSSMEEDKF 1210 

Qy 256 SEAS KEVSEK AKTLL I DRDLTE 277 

I = = = h I I : : || | :| : 

Db 1211 SKSALRDAYRPEETDVKTGAELDI KDVSDERLS PAKS PSLS PS P PS P I EKTPLGERSV- N 12 69 

Qy 278 FS ELEYSEMGSSFS-VSPKAESAVI VANPRE E 1 1 VKNK 314 

II I - I I : : | | | | | : \ \ | | : : : : 29NBF 50XS 
Db 1270 FSLTPNEIKASAEGEATAWSPGVTQAWEEHCASPEEKTLEWSPSQSVTGSAGHTPYY 1329 

Qy 315 DEEEKLVSNNILHXQQELPTALTKLVKEDEWSSE--KAKDSFNEKRVAVEAPMRE 368 

M= - : I : I M I : I I I ||: : :|| | 

Db 1330 QS PTDE KS SHL PTE VTENAQAVP VSFEFTEAKDE-NER- -SSISPMDE 1374 

Qy 3 69 EYADFK- PFERVWE VKDSKED SDMLAAGGKlESNLESKVDKKCFADSL 415 

I = I 1 = 1 = = || H I I = || I I h hi 

Db 1375 PVPDSESPIEKVLSPLRSPPLIGSESAYEDFLSADDKALGRRSESPFEGKNGKQGFSD-- 1432 

Qy 416 EQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNP AATESIATNIFPLLED- 471 

1 = 1 I I II : = hi |:| h I - I | 

Db 1433 KES PVS - DLTSDLYQDKQEEKRAGF I P I KEDFS PEKKASDAE I MSSQSALALDE 1485 

Qy 472 PTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 

M : = : I =:| I h= I hh I = : 
Db 1486 RKLGGDGS PTQVDVSQFGS FKEDTKMS I SEGTVSDKSATPVDEGAEDT YSHMEGVAS 1542 

Qy 524 VTEEWA--NMPEGLTPDLVQEACESELNEVTGTKIAYETPCMDLVQTSEVMQES-LYPAA 580 

l : II ^ II I I I : H: I- : :||| |h : h 

Db 1543 VSTASVATSSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTFQETEMSPSK 1601 

QY 581 QLCP SFEESEATPSPVLPDI VMEAPLNSAV P 611 

= M :: : I : : |:| :| | 

Db 1602 EECPRPMSISPPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEHSLAMDFSRQSPDHP 1661 

Qy 612 SAGASVIQ PSSSPLEASSVNYESIKHE- -PENPPPY EEAMS VS - LKVSG 657 

II - h | : |: |: | | | | :||| :: | 

Db 1662 TVGAGMLHITENGPTEVDYSPSDIQDSSLSHKI PPTEEPSYTQDNDLSELISVSQVEASP 1721 

QY 658 IKEEIKEPENINAALQETE APYISIACDLIKE TKLSAEP 696 

I I : I I I : I hi : - III : 

Db 1722 STSSAHTPSQIASPLQEDTLSDWPPRDMSLYASLASEKVQSLEGEKLSPKSDISPLTPR 1781 

QY 697 APDFSDYSEMAK VEQPVPDHSELVEDS 723 

H III : II :| | =: |: 

^ 1782 ESS PT YS PGFSDSTSGAKESTAAYQTSSS P P I DAAAAEP YGFRSSMLFDTMQHHLALSRD 1841 

QY 7 24 SPDSEPVDLFSDD SIPDVP QKQD 746 

llllll: II =1 : 

Db 1842 LTTSSVEKDNGGKTPGDFNYAYQKPESTTESPDEEDYDYESHEKTIQAHDVGGYYYEKTE 1901 

Qy 74 7 ETVML VKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKD 7 97 



Db 1902 RTIKSPCDSGYSYETIEKTTKTPEDGGYSCEITEKTTRTPEEGGYSY EISEK 1953 

Qy 798 TLLPDEVS- -TLSKKEKI PLQMEELSTAVYSNDD LFISKEA 836 

I III I I h --l :| I h 

Db 1954 TTRTPEVSGYTYEKTERSRRLLDDISNGYDDTEDGGHTLGDCSYSYETTEKITSFPESES 2013 

Qy 837 QIRETETFSDSSP IEI IDEFPTLISSKTDSFSKLAREYT DL 877 

II I : II I : | | : | || || 

Db 2014 YSYETTTKTTRS PDTSAYCYETMEKI TKTPQASTYS YETSDRCYTPERKS PSEARQDVDL 2073 

Qy 878 EVSH-KSEIANA PDG AGSLPCTELPHDLSLKNIQP KV 913 

I I hi- I = I I I I I = I 

Db 2074 CLVSSCEFKHPKTELSPSFINPNPLEWFAGEEPTEESERPLTQSGGAPPPSGGKQQGRQC 2133 

Qy 914 EEKI SFSDDFSKNGSATSKVLLLPPDVSALGHTQAEI ES I VKPKVLEKEAEKK- LPSD - - 970 

Db 2134 DETPPTSVSESAPSQTDSDV PPETE ECPSITADANLDSEDESETI PTDKT 2183 

Qy 971 TEKEDRSPS A I FSADLGKTSWDLLYWRDI KKTG 1004 

: I I I I I :||| || Ml 

Db 2184 VTYKHMDPPPAPMQDRSPSPRHPDVSMVDPEALAIEQNLGKALKKDLKEKAKTKKPG 2240 



Search completed: January 22, 
Job time : 25.6482 sees 



2004, 16:31:59 



